tags : Data Engineering

User defined functions (UDF)

Resources

polars, loops, partitions, delta merges and python memory management

Issues:

What to do?

  • We should focus more on reducing the memory load part of it, last resort if we’re not able to release memory we can restart and it’s a python/polars problem
  • Loading not needed data into memory is a delta-rs issue that we need to fix