tags : Data Engineering

Incremental batch Jobs

Mets

  • It’s given that for the inserts themselves we’ll be using upsert or handling de-duplication in some-way
  • Secondly its given that all of these jobs by nature will be idempotent
  • There’s also CDC and SCD Type2

Patterns

Process Indicator/ High Watermark pattern