tags : Data Engineering

User defined functions (UDF)

Syntax FAQ

Debugging

Query Explain

Optimizations

Pushdown

Others

Resources

polars, loops, partitions, delta merges and python memory management

Issues:

What to do?

  • We should focus more on reducing the memory load part of it, last resort if we’re not able to release memory we can restart and it’s a python/polars problem
  • Loading not needed data into memory is a delta-rs issue that we need to fix