You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat: Rework of storage based broadcast join for more efficient buffering (prestodb#26041)
Make presto-on-spark native broadcast join more memory/performance
efficient. It is done by
* Allowing broadcast join writer to be able to separate serialized pages
and hence able to buffer instead of directly write through for every
input vector batch.
* Adding file footer information for storing page sizes such that reader
can use to give size estimation information to exchange client, allowing
better control of client memory.
* Make broadcast exchange source async instead of blocking IO call to
leverage parallelism.
This makes storage based broadcast join shuffle
* memory bounded by exchange's memory cap.
* 5x more performant
== NO RELEASE NOTE ==
0 commit comments