Add support for Apache ORC format #4933
Replies: 3 comments 5 replies
-
I will explain why the encapsulation of Dwrf and ORC is halfway complete. the module the classes, for example, But for some other classes, they are not well designed. For example, let's see another example, We should complete this work in the same way as |
Beta Was this translation helpful? Give feedback.
-
We can start from 1 by adding support for decoders and types. 2 is a little risky but we can give it a try. |
Beta Was this translation helpful? Give feedback.
-
I have completed almost this discussion, please review this PR for me PR |
Beta Was this translation helpful? Give feedback.
-
Hi Velox community,
We are from Meituan's big data team and are currently working on adding support for the Apache ORC format in Velox.
We have enhanced the ORC Reader/Writer based on the current implementation of dwio/dwrf. This primarily involves adding RLEv2 Encoder/Decoder, as well as column readers for date/string/timestamp/decimal types.
The current approach of incorporating Apache ORC into Dwrf seems to have some issues, especially since the encapsulation of Dwrf and ORC is only halfway complete. We have some ideas to optimize this part and would like to discuss them with the Velox community.
We hope to continue contributing with the Velox community in supporting the ORC format by enhancing, refactoring, and optimizing it.
Beta Was this translation helpful? Give feedback.
All reactions