Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
261 commits
Select commit Hold shift + click to select a range
ba8ca8b
feat: Add MakeRowFromMap Utility Class (#14671)
Sep 12, 2025
96e7cf6
docs: Fix install prefix to use $USER instead of $USERNAME for better…
PHILO-HE Sep 12, 2025
4258da0
refactor: Remove unused variable in coalesceIo() function (#14780)
lingbin Sep 12, 2025
d806574
feat: Add Iceberg truncate function (#14774)
jinchengchenghh Sep 12, 2025
505a600
feat: Add counter for nimble and dwrf writer (#14841)
skyelves Sep 12, 2025
059cd13
feat: Support Spark ArraySort with lambda function (#10138)
boneanxs Sep 12, 2025
1a5e0f7
fix(parquet): Write options incorrectly use session config (#14676)
jinchengchenghh Sep 12, 2025
07a4e28
fix(cuDF): Compilation failure (#14799)
rui-mo Sep 12, 2025
0997850
refactor: Remove remaining calls to registerConnectorFactory (#14848)
mbasmanova Sep 12, 2025
6c1b751
build: Fix QuantileDigest build error due to incorrect allocator type…
kou Sep 13, 2025
c02d0e9
refactor(buffer): Add verbose version of AlignedBuffer::allocate (#14…
sdruzkin Sep 13, 2025
71a3103
feat: Add filter support for index join which can't be converted into…
xiaoxmeng Sep 14, 2025
f190b53
refactor(expression): Formalize ExpressionRewrite framework (#14658)
pramodsatya Sep 15, 2025
de31a3e
feat: Add type support for TIME (#14836)
Sep 15, 2025
a03a134
feat(cudf): Support enable cudf in task level (#14216)
jinchengchenghh Sep 15, 2025
bcba5be
fix(build): Benchmark compilation (#14552)
czentgr Sep 15, 2025
eb59910
refactor: Extract serialization constants to a common file (#14863)
jagill Sep 15, 2025
8e2243e
build(ci): Bump pypa/gh-action-pypi-publish from 1.12.4 to 1.13.0 (#1…
dependabot[bot] Sep 15, 2025
6626f21
fix(build): ABFS connector install for macOS (#14852)
czentgr Sep 15, 2025
4354294
feat(readers): Cache row size estimates (#14854)
Sep 15, 2025
5b273c6
feat(readers): Customize column loader to track average row size (#14…
Sep 15, 2025
42f116d
feat(readers): Optionally disable row size tracking (#14857)
Sep 15, 2025
aaa19e5
build(ci): Bump actions/github-script from 7.0.1 to 8.0.0 (#14768)
dependabot[bot] Sep 16, 2025
8321974
feat: Add support for TIME in Simple Function Interface (#14808)
Sep 16, 2025
f1c2926
fix(misc): Mute previously introduced spammy log (#14871)
Sep 16, 2025
b3c1f38
build(pyvelox): Don't install pyvelox_libraries with VELOX_BUILD_PYTH…
kou Sep 16, 2025
b240d95
refactor: Reuse SplitReader::createRowReader in Meta internal SplitRe…
Yuhta Sep 16, 2025
b564fe9
refactor: Readability improvements, avoid few allocations (#14850)
MBkkt Sep 16, 2025
a20d361
build(ci): Bump actions/setup-python from 3.1.4 to 6.0.0 (#14769)
dependabot[bot] Sep 16, 2025
70fcb51
feat: Extend PlanConsistencyChecker to check AggregationNode (#14877)
mbasmanova Sep 16, 2025
6d0688b
build(ci): Bump actions/checkout from 3.6.0 to 5.0.0 (#14771)
dependabot[bot] Sep 16, 2025
2188fd5
build: Fix CuDF + Parquet test build failure (#14870)
kou Sep 17, 2025
4f96755
build: Remove needless `gflags::gflags` workaround (#14829)
kou Sep 17, 2025
2762233
build: Install missing header files by `cmake --install` (#14872)
kou Sep 17, 2025
5c91e83
feat(cudf): Support hash_with_seed in cudf (#14820)
jinchengchenghh Sep 17, 2025
3c975ab
feat(cudf): Large string support for importFromArrow (#14359)
karthikeyann Sep 17, 2025
749c995
misc: Deprecate old IndexLookupJoinNode constructor (#14881)
zacw7 Sep 18, 2025
b0d52b6
refactor(cudf): Add CudfHiveConnector (#14294)
devavret Sep 18, 2025
06ea190
feat(cudf): Support right join and right semi-join (#14441)
shrshi Sep 18, 2025
44b4711
refactor: Use 'cbegin()' to assign to 'const_iterator' (#14834)
lingbin Sep 18, 2025
d75b0fd
fix: Expose symbol tpcds::fromTableName (#14788)
pramodsatya Sep 18, 2025
e8260f4
fix: Make Spark abs function ansi-compliant (#14346)
nimesh1601 Sep 18, 2025
054cf93
fix(parquet): Read unannotated array (#13864)
rui-mo Sep 18, 2025
626e4b1
fix: Fix flaky ScaleWriterLocalPartitionTest.unpartitionBasic (#14892)
gggrace14 Sep 18, 2025
2226167
fix: avoid vector reallocation in Presto serializer (#14884)
marksantaniello Sep 18, 2025
c142946
build(ci): Bump astral-sh/setup-uv from 6.6.0 to 6.7.0 (#14860)
dependabot[bot] Sep 18, 2025
43c31f4
feat(abfs): Add DynamicSasTokenClientProvider to support dynamic SAS …
marin-ma Sep 18, 2025
85591fa
feat: Add support for session start time in QueryConfig (#14894)
Sep 18, 2025
2c1a1b0
refactor: Do not hold shared_ptr of the TaskQueue (#14896)
duanmeng Sep 19, 2025
53a24a8
feat: Templatize SparseHLL and DenseHLL memory (#13515)
natashasehgal Sep 19, 2025
c43eab5
feat: Add fuzzer generator for fb_dedup_normalize_text (#14824)
duxiao1212 Sep 19, 2025
bccd9ff
build: Use the official Snappy CMake package (#14564)
kou Sep 19, 2025
aacc023
fix(cuDF): Build failure caused by merging two PRs together (#14902)
devavret Sep 19, 2025
fe2cc17
fix(misc): Slightly improve row size tracking query config documentat…
Sep 19, 2025
6ca6272
misc: Remove doy alias for Spark dayofyear function (#14898)
zhli1142015 Sep 20, 2025
d96c655
docs(cuDF): First draft of readme (#14752)
GregoryKimball Sep 20, 2025
7d33a6a
feat(hdfs): Add close() API to disconnect the hdfs connection (#14665)
JkSelf Sep 20, 2025
a029fcf
test: Expand test coverage of public APIs for all types in ConstantTy…
kKPulla Sep 20, 2025
d8a7a5e
fix(hdfs): Nullify hdfsFile_ before checking success status (#14666)
boneanxs Sep 20, 2025
de60e06
feat: Add parallel unit loader in Hive and implement it for DWRF (#14…
kewang1024 Sep 22, 2025
602ffe1
feat(hive): Support decimal as partition key type in Hive write and r…
jinchengchenghh Sep 22, 2025
785b586
fix: Do not overwrite valid promise (#14923)
Yuhta Sep 22, 2025
e9ce22b
Replace boost::regex with RE2 regex in url_extract_parameter and fb_u…
abhinavmuk04 Sep 22, 2025
5ced240
feat: Pass requestOptions per I/O call (#14883)
vhsu14 Sep 22, 2025
426516c
test(geo): Add tests for different output batch sizes (#14924)
jagill Sep 22, 2025
27e9092
build: Fix TpcdsConnector error "'this' pointer is null" (#14878)
zhztheplayer Sep 23, 2025
7d7a91f
fix: ArrowArrayStruct contains null data pointer for a buffer with no…
wecharyu Sep 23, 2025
9578779
refactor(TopNRowNumber): Abstract computeNextRankInMemory(InSpill) fu…
aditi-pandit Sep 23, 2025
7c7484c
fix(TextReader): Fix incorret result for DECIMAL and DATE (#14832)
xin-zhang2 Sep 23, 2025
07ae1e8
fix: Reenable intermediate/final type validation for aggregate functi…
kletkavrubashku Sep 23, 2025
ccd8d14
feat: Add memory pool allocation sites printing in debug mode (#14926)
tanjialiang Sep 23, 2025
b3789e3
misc: tweak byteRangesFromIOBuf (#14922)
yfeldblum Sep 23, 2025
8cfcb29
fix(cuDF): Fix an exception when we get empty input for avg aggregati…
devavret Sep 23, 2025
62fe64a
fix(cudf): Fix table scan fallback condition and library (#14728)
jinchengchenghh Sep 23, 2025
b40a2d7
docs(hdfs): Update libhdfs3 and jvm libhdfs support (#14616)
JkSelf Sep 23, 2025
36037c9
feat: Add support for casting TIME to VARCHAR (#14887)
Sep 24, 2025
c3ff34d
feat: Add support for localtime (#14886)
Sep 24, 2025
fa3061e
Add insertTableHandle as input parameter to tableWrite (#14840)
emilysun201309 Sep 24, 2025
50175fc
feat: Allow transferring memory to a given memory pool (#14918)
kagamiori Sep 24, 2025
7f8d5a9
docs: Add Ke Jia to Storage Adapters maintainer list (#14938)
pedroerp Sep 24, 2025
bbb8dbe
fix: Support clustered raw input n EbfAggregator to optimize the memo…
xiaoxmeng Sep 24, 2025
73a9b28
misc(dwrf): Bump default DWRF flat map key limit to 30k (#14810)
sdruzkin Sep 25, 2025
c3c1b42
fix: Remove geometry from supported types for Presto Expr fuzzers (#1…
kKPulla Sep 25, 2025
9cce4aa
feat(type): Add support for signatures with homogeneous structs (#14675)
furmak331 Sep 25, 2025
71961ef
feat: Add PlanNode::findNodeById API (#14953)
mbasmanova Sep 25, 2025
a0eaea2
feat: Add PlanNode::is<T> and as<T> convenience APIs (#14954)
mbasmanova Sep 25, 2025
55bafa8
refactor: Delete deprecated Connector::supportsSplitPreload API (#14955)
mbasmanova Sep 25, 2025
563057c
refactor: Deprecate ConnectorFactory registry (#14957)
mbasmanova Sep 25, 2025
4b57386
feat: Implement ST_LineFromText, ST_LineString, ST_MultiPoint (#14705)
Sep 25, 2025
622ae59
refactor: Use ExprEnum enum check instead of cast in Expression.* (#1…
juwentus1234 Sep 26, 2025
8875b4d
feat: Report number of tiny/large entries (#14960)
zacw7 Sep 26, 2025
cccf849
feat: Centralize error propagation in SpillMerger (#14761)
duanmeng Sep 29, 2025
9dd3833
feat: Add support for casting TIME to BIGINT in Velox expressions (#1…
han-yan01 Sep 29, 2025
08f7340
fix: Data race inside tpch gen (#14984)
MBkkt Sep 30, 2025
59ab756
build: Fix unsupported bison syntax with bison 3.0.4 (#14961)
zhztheplayer Sep 30, 2025
7b7d6a1
feat: Allow transferring a vector to a given memory pool (#13814)
kagamiori Sep 30, 2025
204dd0a
fix: Register companion functions in registerTDigestAggregate (#14995)
MBkkt Sep 30, 2025
1c8743b
fix: Fix race in MemoryPool in debug mode (#14983)
kletkavrubashku Sep 30, 2025
90973bd
fix(cudf): Store memory resource mr_ to retain mr (#14933)
karthikeyann Sep 30, 2025
0e359bd
refactor: Enable remote function thrift client extensibility (#15001)
pedroerp Sep 30, 2025
75d32b2
feat: Add merge_hll native function (#14804)
natashasehgal Sep 30, 2025
198082a
fix: Add missing includes (#14980)
MBkkt Sep 30, 2025
6bbc936
test: Add IExpr::dropAlias API (#15007)
mbasmanova Oct 1, 2025
5793b33
refactor: Remove unused BaseVector::inMemoryBytes API (#14970)
xiaoxmeng Oct 1, 2025
13310e4
test(remote-function): Check for correct exception propagation (#15009)
pedroerp Oct 1, 2025
242fecf
fix(TextReader): Fix C++ structured binding lambda capture issue in T…
macvincent Oct 1, 2025
6998a14
qdigest_agg - Update intermediate type to VARBINARY (#15012)
natashasehgal Oct 1, 2025
d0179ac
fix(hive): Fix subfield filter extraction for AND/OR more than 2 argu…
Yuhta Oct 1, 2025
27e95e9
feat(geo): Add optional geometry and radius fields to SpatialJoinNode…
jagill Oct 1, 2025
c060962
feat: Add ignoreDuplicates flag to aggregate function metadata (#14976)
MBkkt Oct 1, 2025
904378e
refactor: Use VELOX_USER_RETURN in Spark arithmetic functions (#14936)
zhli1142015 Oct 1, 2025
22b75ac
fix: Generic Configuration section not shown (#15019)
juwentus1234 Oct 1, 2025
c623ead
feat: Change storage stats to per call instead per driver (#14950)
vhsu14 Oct 1, 2025
a34126b
misc: Remove unneeded use-count checking logic (#14621)
lingbin Oct 2, 2025
f4c116b
refactor: Extract default-value vector creation logic from MakeRowFro…
Oct 2, 2025
2e3fc87
fix: Disable intermediate/final type validation as they are not prod …
kletkavrubashku Oct 2, 2025
1dd71df
misc: Make serialized page file common utility (#14965)
tanjialiang Oct 2, 2025
a6fa169
feat: Add Spark months_between function (#14909)
zml1206 Oct 2, 2025
cd25622
refactor: Use std::move to avoid copy in HivePartitionFunction (#14944)
lingbin Oct 2, 2025
ba06895
misc: Optimize error handling in _VELOX_RETURN_IMPL (#14951)
zhli1142015 Oct 2, 2025
81116f5
fix: Fix incorrect initialization of empty 'folly::Promise' object in…
lingbin Oct 2, 2025
846b3dc
fix: Do not allow registering companion functions for TDigest aggrega…
MBkkt Oct 2, 2025
74466b4
feat: Add support for Time with Timezone type (#14999)
Oct 2, 2025
76492e9
feat(cudf): Disable async MR priming in Velox cuDF (#14997)
bdice Oct 2, 2025
ed9efc7
refactor(abfs): Extract the logic to get the accountName and authType…
JkSelf Oct 2, 2025
561d175
fix: MetadataFilter construction from flat (multi) And/Or (#14981)
MBkkt Oct 2, 2025
c20ed29
feat: Add basic support for writing to an Iceberg table (#14723)
PingLiuPing Oct 3, 2025
e9ef38b
fix(CacheInputStream): Prevent int32_t overflow in CacheInputStream::…
wutiangan Oct 3, 2025
012f801
feat: Add simple function support for TIME WITH TIME ZONE (#15024)
Oct 3, 2025
ecb2baa
feat: Add config option to toggle expr deduplication for non-determin…
kKPulla Oct 3, 2025
88b8d74
feat(cudf): Support round function (#14958)
jinchengchenghh Oct 3, 2025
68ce938
test(wave): Disable NimbleReaderTest.TrivialWithCompressionShouldFail…
Yuhta Oct 3, 2025
22fec44
feat(cudf): Add round robin batch support for Local partition in Velo…
karthikeyann Oct 3, 2025
2bc8c9d
feat: Add gh, jq to fedora (#15037)
Oct 3, 2025
2876bc4
feat: Add TIME support to fuzzer intermediate type transforms (#15000)
han-yan01 Oct 3, 2025
0a3736b
fix: Use correct C++ without UB (data race), that produce same or fas…
MBkkt Oct 3, 2025
5cb019d
Add array_subset UDF analogous to map_subset (#14862)
abhinavmuk04 Oct 4, 2025
e2ce8d3
fix(test): Fix AsyncDataCache mulithreading data race issue (#15032)
juwentus1234 Oct 4, 2025
9b1507c
fix: Remove the backward macro to fix presto failure (#15042)
xiaoxmeng Oct 4, 2025
0477c1c
fix: Fix velox exec cmake (#15048)
xiaoxmeng Oct 5, 2025
6dce9b2
refactor: Move driver creation into task init from the first next cal…
xiaoxmeng Oct 5, 2025
207c1b6
fix: Fix sxtream failed test by supporting task cancellation in test …
xiaoxmeng Oct 6, 2025
5a31ae3
fix(ssdcache): Write rate calculation (#15028)
lingbin Oct 6, 2025
a31539e
docs: Add new component and maintainer for LibcuDF bindings (#15057)
pedroerp Oct 6, 2025
78159fa
refactor: Remove legacy task spill code after update prestissimo (#15…
xiaoxmeng Oct 6, 2025
aa406cb
feat(cudf): Remove gflags in favor of CudfConfig (#14963)
majetideepak Oct 6, 2025
5c1dac4
fix: Fix the test task cancellation condition (#15065)
xiaoxmeng Oct 7, 2025
80be186
feat: Add Variant::toJson(Type) API that doesn't require shared_ptr (…
mbasmanova Oct 7, 2025
f266c84
docs: Fix iceberg functions formatting and some warnings (#14911)
majetideepak Oct 7, 2025
2113803
build(ci): Bump actions/cache from 4.2.4 to 4.3.0 (#14993)
dependabot[bot] Oct 7, 2025
21da671
fix(build): Bump Hadoop version for more reliable download (#14928)
simoneves Oct 7, 2025
d93affa
build: Add 'abseil' as a build dependency for re2 (#14849)
tdcmeehan Oct 7, 2025
81d943d
feat: Add IConfig interface (#14889)
codeworse Oct 7, 2025
7155c23
fix: Spark timestamp_seconds function (#14819)
rui-mo Oct 7, 2025
2af79be
feat: Add reading flatmap as struct feature in TableEvolutionFuzzer (…
juwentus1234 Oct 7, 2025
8a5c129
misc: Optimization - std::move in Driver Output (#15089)
natashasehgal Oct 8, 2025
8692dcc
docs: Fix formatting in presto/array.rst (#15074)
steveburnett Oct 8, 2025
31f4f00
docs: C++20 switch blog post (#14611)
czentgr Oct 8, 2025
e183b88
refactor: Remove usage of folly::StringPiece in DATE()->toDays() (#15…
pedroerp Oct 9, 2025
394eeb7
feat: Add support for timePlusIntervalDayToSecond and timePlusInterva…
pratikpugalia Oct 9, 2025
7317117
refactor(geo): Make Spatial NLJ logic more explicit (#15088)
jagill Oct 9, 2025
3c7ce8a
build: Add 'perf' type to conventional commits (#15104)
mbasmanova Oct 9, 2025
34c0286
fix(hdfs): Failing CI tests (#15100)
majetideepak Oct 9, 2025
208a140
fix: merge_hll decode elements (#15092)
natashasehgal Oct 9, 2025
f2d68e9
refactor: Remove usage of folly::StringPiece in RowType::findChild() …
pedroerp Oct 9, 2025
f18ba36
build(ci): Bump docker/login-action from 3.5.0 to 3.6.0 (#15055)
dependabot[bot] Oct 10, 2025
3dbf68a
build(ci): Bump astral-sh/setup-uv from 6.7.0 to 6.8.0 (#15054)
dependabot[bot] Oct 10, 2025
8e0b3a6
feat: Support row iterator for unsafe row to control memory usage for…
xiaoxmeng Oct 10, 2025
a3a47e0
opt: Optimize the sapphire cosco shuffle deserialization with batch p…
xiaoxmeng Oct 10, 2025
32cab94
refactor: Remove usage of folly::StringPiece in common/encode (#15114)
pedroerp Oct 10, 2025
ec4aa88
refactor: Simplify exchange is blocked condition proccessing (#15118)
xiaoxmeng Oct 10, 2025
4fb42d3
refactor: Remove usage of folly::StringPiece in HiveTypeParser (#15096)
pedroerp Oct 10, 2025
e120e32
refactor: Use weaker memory order for Buffer counter (#14987)
MBkkt Oct 10, 2025
ad4cc37
docs: Clarify documentation for from_unixtime(timestamp) Presto funct…
Oct 10, 2025
71315ab
Relax type comparison requirement (#15121)
Oct 10, 2025
c5c1216
refactor: Remove usage of folly::StringPiece in Variant (#15095)
pedroerp Oct 10, 2025
ed1b87f
docs: Update coding style guidance for folly::StringPiece (#15122)
pedroerp Oct 10, 2025
d92fb0f
feat: Add fedora runs to CI (#14575)
kgpai Oct 10, 2025
1e26a47
feat: Add support for intervalDayToSecondPlusTime and intervalMonthTo…
pratikpugalia Oct 10, 2025
206238a
refactor: Move folly::StringPiece::endsWith() usage to C++20 (#15123)
pedroerp Oct 11, 2025
768be24
feat: Add TIME support to minute function (#15068)
han-yan01 Oct 11, 2025
a8cf88c
opt: Optimize the buffer isView check (#15120)
xiaoxmeng Oct 11, 2025
fcf4943
refactor: Buffer (#15085)
MBkkt Oct 11, 2025
8f5e09f
docs(geo): Fix typo for st_relate function (#15067)
jkhaliqi Oct 13, 2025
757fdb1
fix(dwio): Fix decompression in ZlibDecompressionStream readOrSkip (#…
xin-zhang2 Oct 13, 2025
d0b0934
build(ci): Free more space for docker builds (#15040)
czentgr Oct 13, 2025
b684677
fix: error propagation in element_at UDF (#15110)
Oct 13, 2025
cac6958
fix: Addup expression stats in PlanNodeStats::operator+=() (#15010)
MnO2 Oct 13, 2025
5deb95a
Fix: Unreachable-fallthrough issue in velox (#15146)
r-barnes Oct 13, 2025
fab2612
feat: Implement basic SpatialIndex (#15102)
jagill Oct 13, 2025
8dd6984
fix: Crash in constructing null ConstantVector of type ROW() (#15018)
zhli1142015 Oct 13, 2025
6e19b81
fix(orc): Session property name for `orc_use_column_name` (#14971)
hantangwangd Oct 13, 2025
110c980
fix(CI): Add setup file changes trigger for macOS (#15133)
czentgr Oct 13, 2025
b625dbc
feat: Add page load time metric (#14587)
rui-mo Oct 13, 2025
4e29e7c
Fix HiveDataSink nonReclaimableSection bug with multiple writers (#15…
emilysun201309 Oct 13, 2025
4fec004
refactor: Remove last usages of folly::StringPiece in velox/functions…
pedroerp Oct 13, 2025
945f552
fix(build): Patch required by macOS for setup abseil dependency (#15132)
czentgr Oct 14, 2025
755c76e
feat(geo): Require geometry fields in SpatialJoinNode (#15107)
jagill Oct 14, 2025
749b1df
fix: TypeParser Enum type name to allow special characters (#15125)
HeidiHan0000 Oct 14, 2025
6d010ff
refactor: Remove usage of folly::StringPiece in JsonExtractor (#15137)
pedroerp Oct 14, 2025
3412b77
fix: Merge wasn't completely correct (#15138)
MBkkt Oct 14, 2025
7b45b3e
refactor: Handle row group inside the row iterator factory (#15154)
xiaoxmeng Oct 14, 2025
2d5040c
feat: Upgrade simdjson to 3.13.0 (#14917)
czentgr Oct 14, 2025
10caa03
feat: Add support for timeMinusIntervalDayToSecond and timeMinusInter…
pratikpugalia Oct 14, 2025
2191ecb
feat: Add support for casting VARCHAR to TIME (#15041)
han-yan01 Oct 14, 2025
fac8169
refactor: Add "Locked" suffix to thread-unsafe methods in AsyncDataCa…
lingbin Oct 14, 2025
dd9dcdd
add CudfAssignUniqueId
karthikeyann Sep 8, 2025
e7f248f
add unit tests for CudfAssignUniqueId
karthikeyann Sep 8, 2025
fc8d48b
fix comments
karthikeyann Sep 15, 2025
5c42d03
Apply suggestions from code review
karthikeyann Oct 6, 2025
1bc2cc9
misc(cache): Update SsdRun code comments and initialization (#15031)
lingbin Oct 14, 2025
1793a77
fix: Make flatmap vector initialization more robust (#15152)
Oct 14, 2025
2465663
feat: Prepare documentation for Delta Lake functions (#14861)
zhztheplayer Oct 14, 2025
6c1e8cb
fix: ST_NumPoints failing with wrong return type (#15129)
jkhaliqi Oct 14, 2025
dcd28b2
refactor: Move legacy vector API under ifdef (#14982)
MBkkt Oct 14, 2025
d187cb0
Reduce NUM_THREADS in Fedora workflow to fix OOM (#15166)
abhinavmuk04 Oct 14, 2025
34c4390
fix: Remove the legacy code in row serializer (#15164)
xiaoxmeng Oct 15, 2025
3ac198b
fix: Deprecate the velox connector factory API (#15162)
xiaoxmeng Oct 15, 2025
57244cb
fix: Spark to_json breaking build (#15160)
Yuhta Oct 15, 2025
a9be5cf
Merge branch 'main' into fea-CudfAssignUniqueId
karthikeyann Oct 15, 2025
7a55b4e
feat: Add FlatMapVector support for Subscript (#15038)
peterenescu Oct 15, 2025
a0e2f84
Add remap_keys velox udf (#15069)
abhinavmuk04 Oct 15, 2025
d0f3e0c
refactor(cudf): Fix velox_cudf_hive_connector dependencies (#15087)
majetideepak Oct 15, 2025
6ec5100
Fix: Remove unused exception parameter from velox/type/Type.cpp (#15182)
r-barnes Oct 15, 2025
d4b1ab2
fix: Reduce lock contention in ScanTracker
arhimondr Oct 15, 2025
3b34135
feat: Allow customized serialized page (#15181)
xiaoxmeng Oct 15, 2025
27bae1e
feat(geo): Add leftNode/rightNode methods to SpatialJoinNode (#15171)
jagill Oct 16, 2025
bcff9f9
Add cosco shuffle write replayer (#15174)
tanjialiang Oct 16, 2025
cd4b519
refactor: Move ioStats and fileReadOps to FileStorageContext struct (…
vhsu14 Oct 16, 2025
6ddd3fc
refactor: Remove unused 'cache_load_quantum' flag (#15158)
lingbin Oct 16, 2025
7884da4
fix: Make `BaseVector::flattenVector` unwrap lazy vectors (#14491)
zhztheplayer Oct 16, 2025
e2f5e6f
perf(exchange): Optimize promise insertion to avoid temporary Promise…
lingbin Oct 16, 2025
6f69159
refactor(stats-reporter): Add default method implementation (#15186)
pedroerp Oct 16, 2025
eb2e62f
feat: Add support for second with TIME type (#15058)
han-yan01 Oct 16, 2025
fe607ba
feat: Add support for casting TIME to TIMESTAMP (#15184)
han-yan01 Oct 16, 2025
e9a6f05
feat: Add TIME support to hour function (#15079)
han-yan01 Oct 16, 2025
87721f1
refactor: Remove backward compatibility code from remote functions (#…
pedroerp Oct 16, 2025
8660c1b
feat: Populate user context for file open requests (#15155)
vhsu14 Oct 16, 2025
b20d235
perf: Read spilled Window data in batches to optimize for small windo…
HolyLow Oct 16, 2025
7d8ab80
fix: Fix Prstisstismo stats reporter build failure (#15194)
xiaoxmeng Oct 16, 2025
2b36d60
refactor(python): Avoid copy on ROW type construction (#15193)
pedroerp Oct 16, 2025
3e6cea1
feat: Add Thrift service LocalRunnerService (#14390)
peterenescu Oct 16, 2025
0907889
feat: Add TIME support to millisecond function (#14967)
han-yan01 Oct 17, 2025
0115c0f
Merge branch 'main' into fea-CudfAssignUniqueId
karthikeyann Oct 17, 2025
f9b67ea
feat(dwio): Enable row size tracking for metalake separately (#15111)
Oct 17, 2025
ee28c00
feat: Add TIME support to date_diff function (#15081)
han-yan01 Oct 17, 2025
0181c0e
fix(cudf): Fix initialize operator in driver creation (#14956)
jinchengchenghh Oct 17, 2025
68ce43b
refactor: Move getVectorSerdeOptions() to a common util location (#15…
tanjialiang Oct 17, 2025
b073a73
feat: Add P4HyperLogLog cast from/to HyperLogLog (#14787)
natashasehgal Oct 17, 2025
970097b
Merge branch 'main' into fea-CudfAssignUniqueId
karthikeyann Oct 17, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
12 changes: 6 additions & 6 deletions .github/workflows/benchmark.yml
Original file line number Diff line number Diff line change
Expand Up @@ -47,14 +47,14 @@ jobs:
INSTALL_PREFIX: ${{ github.workspace }}/dependencies
steps:

- uses: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683 # v4.2.2
- uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
with:
path: velox

persist-credentials: false

- name: Restore Dependencies
uses: actions/cache/restore@0400d5f644dc74513175e3cd8d07132dd4860809 # v4.2.4
uses: actions/cache/restore@0057852bfaa89a56745cba8c7296529d2fc39830 # v4.3.0
id: restore-deps
with:
path: ${{ env.INSTALL_PREFIX }}
Expand Down Expand Up @@ -84,7 +84,7 @@ jobs:

- name: Save Dependencies
if: ${{ steps.restore-deps.outputs.cache-hit != 'true' }}
uses: actions/cache/save@0400d5f644dc74513175e3cd8d07132dd4860809 # v4.2.4
uses: actions/cache/save@0057852bfaa89a56745cba8c7296529d2fc39830 # v4.3.0
with:
path: ${{ env.INSTALL_PREFIX }}
key: dependencies-benchmark-${{ hashFiles('velox/scripts/setup-ubuntu.sh') }}
Expand Down Expand Up @@ -151,12 +151,12 @@ jobs:
merge-multiple: true
path: /tmp/artifacts/

- uses: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683 # v4.2.2
- uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
with:
path: velox
persist-credentials: false

- uses: actions/setup-python@a26af69be951a213d495a4c3e4e4022e16d87065 # v5.6.0
- uses: actions/setup-python@e797f83bcb11b83ae66e0230d6156d7c80228e7c # v6.0.0
with:
python-version: '3.10'
cache: pip
Expand Down Expand Up @@ -187,7 +187,7 @@ jobs:
run: echo "failed=true" >> $GITHUB_OUTPUT

- name: Create a GitHub Status on the contender commit (whether the upload was successful)
uses: actions/github-script@60a0d83039c74a4aee543508d2ffcb1c3799cdea # v7.0.1
uses: actions/github-script@ed597411d8f924073f98dfc5c65a23a2325f34cd # v8.0.0
if: ${{ !cancelled() && steps.extract.conclusion != 'failure' }}
with:
script: |
Expand Down
8 changes: 4 additions & 4 deletions .github/workflows/breeze.yml
Original file line number Diff line number Diff line change
Expand Up @@ -54,13 +54,13 @@ jobs:
working-directory: velox
steps:

- uses: actions/checkout@v4
- uses: actions/checkout@v5.0.0
with:
path: velox
persist-credentials: false

- name: Install uv
uses: astral-sh/setup-uv@4959332f0f014c5280e7eac8b70c90cb574c9f9b # v6.6.0
uses: astral-sh/setup-uv@d0cc045d04ccac9d8b7881df0226f9e82c39688e # v6.8.0

- name: Install Dependencies
run: |
Expand Down Expand Up @@ -95,13 +95,13 @@ jobs:
working-directory: velox
steps:

- uses: actions/checkout@v4
- uses: actions/checkout@v5.0.0
with:
path: velox
persist-credentials: false

- name: Install uv
uses: astral-sh/setup-uv@4959332f0f014c5280e7eac8b70c90cb574c9f9b # v6.6.0
uses: astral-sh/setup-uv@d0cc045d04ccac9d8b7881df0226f9e82c39688e # v6.8.0

- name: Install Dependencies
run: |
Expand Down
4 changes: 2 additions & 2 deletions .github/workflows/build-metrics.yml
Original file line number Diff line number Diff line change
Expand Up @@ -49,7 +49,7 @@ jobs:
run:
shell: bash
steps:
- uses: actions/checkout@v4
- uses: actions/checkout@v5.0.0
with:
ref: ${{ inputs.ref || github.sha }}
persist-credentials: false
Expand Down Expand Up @@ -145,7 +145,7 @@ jobs:
needs: metrics
steps:
- name: Checkout
uses: actions/checkout@b4ffde65f46336ab88eb53be808477a3936bae11 # v4.1.1
uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
with:
fetch-depth: 0
persist-credentials: true
Expand Down
8 changes: 4 additions & 4 deletions .github/workflows/build_pyvelox.yml
Original file line number Diff line number Diff line change
Expand Up @@ -50,14 +50,14 @@ jobs:
matrix:
os: [8-core-ubuntu, macos-13, macos-14]
steps:
- uses: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683 # v4.2.2
- uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
with:
ref: ${{ inputs.ref || github.ref }}
fetch-depth: 0

persist-credentials: false

- uses: actions/setup-python@a26af69be951a213d495a4c3e4e4022e16d87065 # v5.6.0
- uses: actions/setup-python@e797f83bcb11b83ae66e0230d6156d7c80228e7c # v6.0.0
with:
python-version: '3.10'

Expand Down Expand Up @@ -162,12 +162,12 @@ jobs:

- run: ls wheelhouse

- uses: actions/setup-python@a26af69be951a213d495a4c3e4e4022e16d87065 # v5.6.0
- uses: actions/setup-python@e797f83bcb11b83ae66e0230d6156d7c80228e7c # v6.0.0
with:
python-version: '3.10'

- name: Publish a Python distribution to PyPI
uses: pypa/gh-action-pypi-publish@76f52bc884231f62b9a034ebfe128415bbaabdfc # v1.12.4
uses: pypa/gh-action-pypi-publish@ed0c53931b1dc9bd32cbe73a98c7f6766f8a527e # v1.13.0
with:
password: ${{ secrets.PYPI_API_TOKEN }}
packages_dir: wheelhouse
25 changes: 18 additions & 7 deletions .github/workflows/docker.yml
Original file line number Diff line number Diff line change
Expand Up @@ -55,16 +55,27 @@ jobs:
steps:
- name: Free Disk Space
run: |
# 15G
sudo rm -rf /usr/local/lib/android || :
# 5.3GB
sudo rm -rf /opt/hostedtoolcache/CodeQL || :
# Re-used from free-disk-space github action.
getAvailableSpace() { echo $(df -a $1 | awk 'NR > 1 {avail+=$4} END {print avail}'); }
# Show before
echo "Original available disk space: " $(getAvailableSpace)
# Remove DotNet.
sudo rm -rf /usr/share/dotnet || true
# Remove android
sudo rm -rf /usr/local/lib/android || true
# Remove CodeQL
sudo rm -rf /opt/hostedtoolcache/CodeQL || true
# Remove Haskell
sudo rm -rf /opt/ghc || true
sudo rm -rf /usr/local/.ghcup || true
# Show after
echo "New available disk space: " $(getAvailableSpace)

- name: Set up Docker Buildx
uses: docker/setup-buildx-action@e468171a9de216ec08956ac3ada2f0791b6bd435 # v3.11.1

- name: Login to GitHub Container Registry
uses: docker/login-action@184bdaa0721073962dff0199f1fb9940f07167d1 # v3.5.0
uses: docker/login-action@5e57cd118135c172c3672efd75eb46360885c0ef # v3.6.0
with:
registry: ghcr.io
username: ${{ github.actor }}
Expand Down Expand Up @@ -130,7 +141,7 @@ jobs:
packages: write
steps:
- name: Login to GitHub Container Registry
uses: docker/login-action@184bdaa0721073962dff0199f1fb9940f07167d1 # v3.5.0
uses: docker/login-action@5e57cd118135c172c3672efd75eb46360885c0ef # v3.6.0
with:
registry: ghcr.io
username: ${{ github.actor }}
Expand Down Expand Up @@ -168,7 +179,7 @@ jobs:
target: [java]
steps:
- name: Login to GitHub Container Registry
uses: docker/login-action@184bdaa0721073962dff0199f1fb9940f07167d1 # v3.5.0
uses: docker/login-action@5e57cd118135c172c3672efd75eb46360885c0ef # v3.6.0
with:
registry: ghcr.io
username: ${{ github.actor }}
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/docs.yml
Original file line number Diff line number Diff line change
Expand Up @@ -51,7 +51,7 @@ jobs:
key: ccache-docs-8-core-ubuntu

- name: Checkout
uses: actions/checkout@b4ffde65f46336ab88eb53be808477a3936bae11 # v4.1.1
uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
with:
fetch-depth: 0
persist-credentials: true
Expand Down
73 changes: 67 additions & 6 deletions .github/workflows/linux-build-base.yml
Original file line number Diff line number Diff line change
Expand Up @@ -36,13 +36,12 @@ jobs:
env:
CCACHE_DIR: ${{ github.workspace }}/ccache
VELOX_DEPENDENCY_SOURCE: SYSTEM
GTest_SOURCE: BUNDLED
cudf_SOURCE: BUNDLED
CUDA_VERSION: '12.8'
faiss_SOURCE: BUNDLED
USE_CLANG: "${{ inputs.use-clang && 'true' || 'false' }}"
steps:
- uses: actions/checkout@v4
- uses: actions/checkout@v5.0.0
with:
fetch-depth: 2
persist-credentials: false
Expand Down Expand Up @@ -109,10 +108,10 @@ jobs:
else
# cuDF (unsupported for Clang) and Faiss (link issue when using Clang)
# are excluded for Clang compilation and need to be added back when using GCC.
EXTRA_CMAKE_FLAGS+="-DVELOX_ENABLE_CUDF=ON"
EXTRA_CMAKE_FLAGS+="-DVELOX_ENABLE_FAISS=ON"
EXTRA_CMAKE_FLAGS+=("-DVELOX_ENABLE_CUDF=ON")
EXTRA_CMAKE_FLAGS+=("-DVELOX_ENABLE_FAISS=ON")
# Investigate issues with remote function service: Issue #13897
EXTRA_CMAKE_FLAGS+="-DVELOX_ENABLE_REMOTE_FUNCTIONS=ON"
EXTRA_CMAKE_FLAGS+=("-DVELOX_ENABLE_REMOTE_FUNCTIONS=ON")
fi
make release EXTRA_CMAKE_FLAGS="${EXTRA_CMAKE_FLAGS[*]}"

Expand All @@ -134,6 +133,8 @@ jobs:
source "/opt/miniforge/etc/profile.d/conda.sh"
conda activate adapters
fi
# Needed for HADOOP 3.3.6 minicluster. Can remove after updating to 3.4.2.
wget https://repo1.maven.org/maven2/org/mockito/mockito-core/2.23.4/mockito-core-2.23.4.jar -O /usr/local/hadoop/share/hadoop/mapreduce/mockito-core-2.23.4.jar

export CLASSPATH=`/usr/local/hadoop/bin/hdfs classpath --glob`
ctest -j 8 --label-exclude cuda_driver --output-on-failure --no-tests=error
Expand Down Expand Up @@ -163,7 +164,7 @@ jobs:
run: |
mkdir -p "$CCACHE_DIR"

- uses: actions/checkout@v4
- uses: actions/checkout@v5.0.0
with:
path: velox
persist-credentials: false
Expand Down Expand Up @@ -206,3 +207,63 @@ jobs:
- name: Run Tests
run: |
cd _build/debug && ctest -j 8 --output-on-failure --no-tests=error

fedora-debug:
runs-on: 8-core-ubuntu-22.04
container: ghcr.io/facebookincubator/velox-dev:fedora
# prevent errors when forks ff their main branch
if: ${{ github.repository == 'facebookincubator/velox' }}
name: Fedora debug
env:
CCACHE_DIR: ${{ github.workspace }}/ccache
defaults:
run:
shell: bash
working-directory: velox
steps:
- name: Get Ccache Stash
uses: apache/infrastructure-actions/stash/restore@3354c1565d4b0e335b78a76aedd82153a9e144d4
with:
path: ${{ env.CCACHE_DIR }}
key: ccache-fedora-debug-default-gcc

- name: Ensure Stash Dirs Exists
working-directory: ${{ github.workspace }}
run: |
mkdir -p "$CCACHE_DIR"

- uses: actions/checkout@v4
with:
path: velox
persist-credentials: false

- name: Clear CCache Statistics
run: |
ccache -sz

- name: Make Debug Build
env:
VELOX_DEPENDENCY_SOURCE: SYSTEM
faiss_SOURCE: BUNDLED
fmt_SOURCE: BUNDLED
simdjson_SOURCE: BUNDLED
gRPC_SOURCE: SYSTEM
MAKEFLAGS: NUM_THREADS=4 MAX_HIGH_MEM_JOBS=4 MAX_LINK_JOBS=3
EXTRA_CMAKE_FLAGS: >-
-DVELOX_ENABLE_PARQUET=ON
-DARROW_THRIFT_USE_SHARED=ON
-DVELOX_ENABLE_EXAMPLES=ON
run: |
uv tool install --force [email protected]
dnf install -y -q --setopt=install_weak_deps=False grpc-devel grpc-plugins
export EXTRA_CMAKE_FLAGS="${EXTRA_CMAKE_FLAGS} -DVELOX_ENABLE_FAISS=ON"
make debug

- name: CCache after
run: |
ccache -vs

- uses: apache/infrastructure-actions/stash/save@3354c1565d4b0e335b78a76aedd82153a9e144d4
with:
path: ${{ env.CCACHE_DIR }}
key: ccache-fedora-debug-default-gcc
8 changes: 7 additions & 1 deletion .github/workflows/macos.yml
Original file line number Diff line number Diff line change
Expand Up @@ -22,6 +22,9 @@ on:
- CMakeLists.txt
- CMake/**
- scripts/setup-macos.sh
- scripts/setup-common.sh
- scripts/setup-versions.sh
- scripts/setup-helper-functions.sh
- .github/workflows/macos.yml

pull_request:
Expand All @@ -31,6 +34,9 @@ on:
- CMakeLists.txt
- CMake/**
- scripts/setup-macos.sh
- scripts/setup-common.sh
- scripts/setup-versions.sh
- scripts/setup-helper-functions.sh
- .github/workflows/macos.yml

permissions:
Expand Down Expand Up @@ -58,7 +64,7 @@ jobs:
INSTALL_PREFIX: /tmp/deps-install
steps:
- name: Checkout
uses: actions/checkout@v4
uses: actions/checkout@v5.0.0
with:
persist-credentials: false

Expand Down
6 changes: 3 additions & 3 deletions .github/workflows/preliminary_checks.yml
Original file line number Diff line number Diff line change
Expand Up @@ -32,10 +32,10 @@ jobs:
pre-commit:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@f43a0e5ff2bd294095638e18286ca9a3d1956744 # v3.6.0
- uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
with:
persist-credentials: false
- uses: actions/setup-python@3542bca2639a428e1796aaa6a2ffef0c0f575566 # v3.1.4
- uses: actions/setup-python@e797f83bcb11b83ae66e0230d6156d7c80228e7c # v6.0.0
- uses: pre-commit/action@2c7b3805fd2a0fd8c1884dcaebf91fc102a13ecd # v3.0.1

title-check:
Expand All @@ -49,7 +49,7 @@ jobs:
import re
import os
title = os.environ["title"]
title_re = r"^(feat|fix|build|test|docs|refactor|misc)(\(.+\))?!?: ([A-Z].+)[^.]$"
title_re = r"^(feat|fix|perf|build|test|docs|refactor|misc)(\(.+\))?!?: ([A-Z].+)[^.]$"
match = re.search(title_re, title)

if match is None:
Expand Down
Loading