Skip to content

Use database to keep processing ledgers / lookup tables #141

@leewujung

Description

@leewujung

Right now the codebase uses use CSV files to pass information for the next flow run or downstream flows. These CSV files store filename and file start/end time on what files have already been processed, so that:

  • when the next same flow runs it will not reprocess data already processed
  • for downstream flows they can just grab already processed data.

These operations are likely better done with databases for robustness and efficiency, especially when there are multiple write operations on the cloud side.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels
    No fields configured for Feature.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions