Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
121 commits
Select commit Hold shift + click to select a range
631f9ab
Use `Display` formatting of `DataType`:s in error messages (#17565)
emilk Sep 17, 2025
9351c7a
docs: Move Google Summer of Code 2025 pages to a section (#17504)
alamb Sep 17, 2025
da89395
feat: Add `OR REPLACE` to creating external tables (#17580)
jonathanc-n Sep 17, 2025
e2a5b57
`avg(distinct)` support for decimal types (#17560)
Jefffrey Sep 18, 2025
70f4d19
chore(deps): bump taiki-e/install-action from 2.61.8 to 2.61.9 (#17640)
dependabot[bot] Sep 18, 2025
b293e2c
chore(deps): bump Swatinem/rust-cache from 2.8.0 to 2.8.1 (#17641)
dependabot[bot] Sep 18, 2025
13208e6
Validate the memory consumption in SPM created by multi level merge (…
ding-young Sep 18, 2025
c910db4
fix(SubqueryAlias): use maybe_project_redundant_column (#17478)
notfilippo Sep 18, 2025
f1765a1
minor: Ensure `datafusion-sql` package dependencies have `sql` flag (…
Jefffrey Sep 18, 2025
4528f2f
optimizer: Rewrite `IS NOT DISTINCT FROM` joins as Hash Joins (#17319)
2010YOUY01 Sep 18, 2025
980c948
Upgrade to arrow 56.1.0 (#17275)
alamb Sep 18, 2025
293bf3e
fix: Preserves field metadata when creating logical plan for VALUES e…
Kontinuation Sep 18, 2025
c1ca3c4
chore(deps): bump serde from 1.0.223 to 1.0.225 (#17614)
dependabot[bot] Sep 18, 2025
d587b8d
chore: Update dynamic filter formatting (#17647)
rkrishn7 Sep 19, 2025
5f0b83c
chore(deps): bump taiki-e/install-action from 2.61.9 to 2.61.10 (#17660)
dependabot[bot] Sep 19, 2025
0aa6d35
proto: don't include parquet feature by default (#17577)
jackkleeman Sep 19, 2025
4e94346
feat: add support for RightAnti and RightSemi join types (#17604)
bvolpato Sep 19, 2025
ac42765
minor: Ensure `proto` crate has datetime & unicode expr flags in data…
Jefffrey Sep 19, 2025
7d50c37
chore(deps): bump indexmap from 2.11.3 to 2.11.4 (#17661)
dependabot[bot] Sep 19, 2025
4a5b137
docs: add xorq to list of known users (#17668)
dlovell Sep 19, 2025
35c1cfd
Introduce `TypeSignatureClass::Binary` to allow accepting arbitrarily…
Jefffrey Sep 19, 2025
b9517a1
docs: deduplicate links in `introduction.md` (#17669)
Jefffrey Sep 19, 2025
d572eeb
Add explicit PMC/committers list to governance docs page (#17574)
alamb Sep 19, 2025
7125e97
fix: Ignore governance doc from typos (#17678)
rkrishn7 Sep 19, 2025
44cd972
Support Decimal32/64 types (#17501)
AdamGS Sep 19, 2025
75c7720
minor: Improve hygiene for `datafusion-functions` macros (#17638)
Jefffrey Sep 20, 2025
14656f5
feat(small): Display `NullEquality` in join executor's `EXPLAIN` outp…
2010YOUY01 Sep 20, 2025
03b6789
Custom timestamp format for DuckDB (#17653)
krinart Sep 20, 2025
1488e10
feat(substrait): add time literal support (#17655)
bvolpato Sep 20, 2025
00f5b7d
Support LargeList for array_sort (#17657)
Jefffrey Sep 20, 2025
523eadd
Support FixedSizeList for array_except (#17658)
Jefffrey Sep 20, 2025
d96fbde
fix: null padding for `array_reverse` on `FixedSizeList` (#17673)
chenkovsky Sep 20, 2025
c2d839f
chore: refactor array fn signatures & add more slt tests (#17672)
Jefffrey Sep 20, 2025
d55fb6d
Support FixedSizeList for array_to_string (#17666)
Jefffrey Sep 21, 2025
6082727
fix: correct statistics for `NestedLoopJoinExec` (#17680)
duongcongtoai Sep 21, 2025
602475f
minor: add SQLancer fuzzed SLT case for natural joins (#17683)
Jefffrey Sep 21, 2025
bea1b0a
chore: Upgrade Rust version to 1.90.0 (#17677)
rkrishn7 Sep 21, 2025
58bc749
Support FixedSizeList for array_position (#17659)
Jefffrey Sep 21, 2025
78d5779
chore(deps): bump the proto group with 2 updates (#16806)
dependabot[bot] Sep 21, 2025
1629420
feat(spark): implement Spark `make_interval` function (#17424)
davidlghellin Sep 21, 2025
b63ca3e
chore: Update READMEs of crates to be more consistent (#17691)
Jefffrey Sep 22, 2025
f04efd1
chore: update a bunch of dependencies (#17708)
Jefffrey Sep 22, 2025
68bc0ce
chore: fix wasm-pack installation link in wasmtest README (#17704)
Jefffrey Sep 22, 2025
ba50f52
Support FixedSizeList for array_slice via coercion to List (#17667)
Jefffrey Sep 22, 2025
768d238
docs: Remove disclaimer that `datafusion` 50.0.0 is not released (#17…
nuno-faria Sep 22, 2025
3ad9da9
chore(deps): bump taiki-e/install-action from 2.61.10 to 2.62.1 (#17710)
dependabot[bot] Sep 22, 2025
79fc94a
perf: Improve the performance of WINDOW functions with many partition…
nuno-faria Sep 22, 2025
52690c6
fix: Partial AggregateMode will generate duplicate field names which …
zhuqi-lucas Sep 22, 2025
95357ed
feat: expose `udafs` and `udwfs` methods on `FunctionRegistry` (#17650)
milenkovicm Sep 22, 2025
a8d5016
Support remaining substrait time literal variations (#17707)
petern48 Sep 22, 2025
eeec017
Bump MSRV to 1.87.0 (#17724)
findepi Sep 22, 2025
4ea7601
Avoid redundant Schema clones (#17643)
findepi Sep 22, 2025
677a000
Use github link instead of relative link to optimizer_rule.rs in quer…
petern48 Sep 22, 2025
46a47a9
Move misplaced upgrading entry about MSRV (#17727)
findepi Sep 22, 2025
22a1eab
Introduce `avg_distinct()` and `sum_distinct()` functions to DataFram…
Jefffrey Sep 22, 2025
23d91c5
Support `WHERE`, `ORDER BY`, `LIMIT`, `SELECT`, `EXTEND` pipe operato…
simonvandel Sep 22, 2025
84b327c
doc: add missing examples for multiple math functions (#17018)
Adez017 Sep 23, 2025
11ffa67
feat: support for null, date, and timestamp types in approx_distinct …
killme2008 Sep 23, 2025
9bc8202
fix(agg/corr): return NULL when variance is zero or samples < 2 (#17621)
killme2008 Sep 23, 2025
348d433
chore(deps): bump taiki-e/install-action from 2.62.1 to 2.62.4 (#17739)
dependabot[bot] Sep 23, 2025
9fae60c
chore(deps): bump tempfile from 3.22.0 to 3.23.0 (#17741)
dependabot[bot] Sep 23, 2025
432995c
chore: make `LimitPushPastWindows` public (#17736)
linhr Sep 23, 2025
2ba2f1c
fix: Remove parquet encryption feature from root deps (#17700)
Vyquos Sep 23, 2025
d5bfcf5
fix: Remove datafusion-macros's dependency on datafusion-expr (#17688)
yutannihilation Sep 23, 2025
33fe3ec
chore: remove homebrew publish instructions from release steps (#17735)
Jefffrey Sep 23, 2025
8471f52
minor: create `OptimizerContext` with provided `ConfigOptions` (#17742)
MichaelScofield Sep 23, 2025
693eeaf
Improve documentation for ordered set aggregate functions (#17744)
alamb Sep 23, 2025
0e3d4a7
docs: fix sidebar overlapping table on configuration page on website …
saimahendra282 Sep 24, 2025
ded6c0d
Add support for calling async UDF as aggregation expression (#17620)
simonvandel Sep 24, 2025
5276522
chore(deps): bump taiki-e/install-action from 2.62.4 to 2.62.5 (#17750)
dependabot[bot] Sep 24, 2025
62e6d5e
(fix): Lag function creates unwanted projection (#17630) (#17639)
renato2099 Sep 24, 2025
6ec14e9
Support `LargeList` in `array_has` simplification to `InList` (#17732)
Jefffrey Sep 24, 2025
ea9fb26
chore(deps): bump wasm-bindgen-test from 0.3.51 to 0.3.53 (#17642)
dependabot[bot] Sep 24, 2025
564864b
feat: support `Utf8View` for more args of `regexp_replace` (#17195)
mbutrovich Sep 24, 2025
ac48ba3
feat(spark): implement Spark `map` function `map_from_arrays` (#17456)
SparkApplicationMaster Sep 24, 2025
4114e78
chore(deps): bump object_store from 0.12.3 to 0.12.4 (#17753)
dependabot[bot] Sep 24, 2025
5bbdb7e
Update `arrow` / `parquet` to 56.2.0 (#17631)
alamb Sep 24, 2025
08e75a9
chore(deps): bump taiki-e/install-action from 2.62.5 to 2.62.6 (#17766)
dependabot[bot] Sep 25, 2025
bdcfd43
Keep aggregate udaf schema names unique when missing an order-by (#17…
wiedld Sep 25, 2025
78786d6
feat : Display function alias in output column name (#17690)
devampatel03 Sep 25, 2025
7f70ac6
Support join cardinality estimation less conservatively (#17476)
jackkleeman Sep 25, 2025
b6f2b08
chore(deps): bump libc from 0.2.175 to 0.2.176 (#17767)
dependabot[bot] Sep 25, 2025
05426bc
chore(deps): bump postgres-types from 0.2.9 to 0.2.10 (#17768)
dependabot[bot] Sep 25, 2025
bb3e0d8
Use `Expr::qualified_name()` and `Column::new()` to extract partition…
masonh22 Sep 25, 2025
c1d6f34
Prevent exponential planning time for Window functions - v2 (#17684)
berkaysynnada Sep 25, 2025
350c5da
docs: add Ballista link to landing page (#17746) (#17775)
Nihallllll Sep 26, 2025
be1b8b6
chore(deps): bump taiki-e/install-action from 2.62.6 to 2.62.8 (#17781)
dependabot[bot] Sep 26, 2025
6c81ca0
chore(deps): bump wasm-bindgen-test from 0.3.53 to 0.3.54 (#17784)
dependabot[bot] Sep 26, 2025
0fefbb1
chore: Action some old TODOs in github actions (#17694)
Jefffrey Sep 26, 2025
691dd47
dev: Add benchmark for compilation profiles (#17754)
2010YOUY01 Sep 26, 2025
9a5024d
chore(deps): bump tokio-postgres from 0.7.13 to 0.7.14 (#17785)
dependabot[bot] Sep 26, 2025
a548cce
chore(deps): bump serde from 1.0.226 to 1.0.227 (#17783)
dependabot[bot] Sep 26, 2025
325e351
chore(deps): bump regex from 1.11.2 to 1.11.3 (#17782)
dependabot[bot] Sep 26, 2025
e566e97
Support `CAST` from temporal to `Utf8View` (#17535)
findepi Sep 26, 2025
783c45b
Add case expr simplifiers for literal comparisons (#17743)
jackkleeman Sep 26, 2025
6e15994
chore: dependabot to run weekly (#17797)
comphead Sep 26, 2025
1a6c79b
[DOCS] Add dbt Fusion engine and R2 Query Engine to "Known Users" (#1…
dataders Sep 26, 2025
22309f9
feat: change `datafusion-proto` to use `TaskContext` rather than`Sess…
milenkovicm Sep 26, 2025
1607561
display window function's alias name in output (#17788)
devampatel03 Sep 26, 2025
5097f2c
docs: update wasmtest README with instructions for Apple silicon (#17…
Jefffrey Sep 27, 2025
82790c5
chore(deps): bump sysinfo from 0.37.0 to 0.37.1 (#17800)
dependabot[bot] Sep 27, 2025
deba006
chore(deps): bump taiki-e/install-action from 2.62.8 to 2.62.9 (#17799)
dependabot[bot] Sep 27, 2025
8c8e565
feat(spark): implement Spark `make_dt_interval` function (#17728)
davidlghellin Sep 27, 2025
bbb5cc7
Fix potential overflow when we print verbose physical plan (#17798)
zhuqi-lucas Sep 27, 2025
2f54f30
Add SedonaDB as known user to Apache DataFusion (#17806)
petern48 Sep 27, 2025
320dd91
Extend datatype semantic equality check to include timestamps (#17777)
shivbhatia10 Sep 28, 2025
d05dcc3
fix: Filter out nulls properly in approx_percentile_cont_with_weight …
Jefffrey Sep 28, 2025
a8925f3
chore: refactor usage of `reassign_predicate_columns` (#17703)
rkrishn7 Sep 28, 2025
2987e80
dev: Add Apache license check to the lint script (#17787)
2010YOUY01 Sep 28, 2025
948f6b8
Fix: common_sub_expression_eliminate optimizer rule failed (#16066)
Col-Waltz Sep 28, 2025
a1d15cc
feat: support multi-threaded writing of Parquet files with modular en…
rok Sep 28, 2025
2d947b3
feat(spark): implement Spark `map` function `map_from_entries` (#17779)
SparkApplicationMaster Sep 29, 2025
a451920
chore: remove dialect settings in SLT tests that are outdated (#17807)
Jefffrey Sep 29, 2025
cc157b8
feat: Add Hash Join benchmarks (#17636)
jonathanc-n Sep 29, 2025
43721dd
chore(deps): bump thiserror from 2.0.16 to 2.0.17 (#17821)
dependabot[bot] Sep 29, 2025
89beee2
chore(deps): bump quote from 1.0.40 to 1.0.41 (#17822)
dependabot[bot] Sep 29, 2025
247450d
chore(deps): bump taiki-e/install-action from 2.62.9 to 2.62.12 (#17823)
dependabot[bot] Sep 29, 2025
63ca0e1
chore(deps): bump serde from 1.0.227 to 1.0.228 (#17827)
dependabot[bot] Sep 29, 2025
5cc0be5
Disable failing benchmark query (#17809)
alamb Sep 29, 2025
7d6d553
chore(deps): bump taiki-e/install-action from 2.62.12 to 2.62.13 (#17…
dependabot[bot] Sep 30, 2025
9c8d3f1
Merge remote-tracking branch 'sandbox/main' into sandbox-main
blaginin Sep 30, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
47 changes: 0 additions & 47 deletions .github/actions/setup-macos-builder/action.yaml

This file was deleted.

9 changes: 0 additions & 9 deletions .github/actions/setup-rust-runtime/action.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -20,10 +20,6 @@ description: 'Setup Rust Runtime Environment'
runs:
using: "composite"
steps:
# https://github.com/apache/datafusion/issues/15535
# disabled because neither version nor git hash works with apache github policy
#- name: Run sccache-cache
# uses: mozilla-actions/sccache-action@65101d47ea8028ed0c98a1cdea8dd9182e9b5133 # v0.0.8
- name: Configure runtime env
shell: bash
# do not produce debug symbols to keep memory usage down
Expand All @@ -32,11 +28,6 @@ runs:
#
# Set debuginfo=line-tables-only as debuginfo=0 causes immensely slow build
# See for more details: https://github.com/rust-lang/rust/issues/119560
#
# readd the following to the run below once sccache-cache is re-enabled
# echo "RUSTC_WRAPPER=sccache" >> $GITHUB_ENV
# echo "SCCACHE_GHA_ENABLED=true" >> $GITHUB_ENV
run: |
echo "RUST_BACKTRACE=1" >> $GITHUB_ENV
echo "RUSTFLAGS=-C debuginfo=line-tables-only -C incremental=false" >> $GITHUB_ENV

2 changes: 1 addition & 1 deletion .github/dependabot.yml
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@ updates:
- package-ecosystem: cargo
directory: "/"
schedule:
interval: daily
interval: weekly
target-branch: main
labels: [auto-dependencies]
ignore:
Expand Down
6 changes: 2 additions & 4 deletions .github/workflows/audit.yml
Original file line number Diff line number Diff line change
Expand Up @@ -42,10 +42,8 @@ jobs:
steps:
- uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
- name: Install cargo-audit
uses: taiki-e/install-action@2fdc5fd6ac805b0f8256893bd4c807bcb666af00 # v2
uses: taiki-e/install-action@d0f4f69b07c0804d1003ca9a5a5f853423872ed9 # v2
with:
tool: cargo-audit
- name: Run audit check
# Ignored until https://github.com/apache/datafusion/issues/15571
# ignored py03 warning until arrow 55 upgrade
run: cargo audit --ignore RUSTSEC-2024-0370 --ignore RUSTSEC-2025-0020
run: cargo audit
5 changes: 4 additions & 1 deletion .github/workflows/dev.yml
Original file line number Diff line number Diff line change
Expand Up @@ -33,7 +33,10 @@ jobs:
name: Check License Header
steps:
- uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
- uses: korandoru/hawkeye@c3d9887d4b48e5b6c77306934606a4ff4623a2d3 # v6.2.0
- name: Install HawkEye
run: cargo install hawkeye --version 6.2.0 --locked --profile dev
- name: Run license header check
run: ci/scripts/license_header.sh

prettier:
name: Use prettier to check formatting of documents
Expand Down
47 changes: 17 additions & 30 deletions .github/workflows/rust.yml
Original file line number Diff line number Diff line change
Expand Up @@ -55,7 +55,7 @@ jobs:
with:
rust-version: stable
- name: Rust Dependency Cache
uses: Swatinem/rust-cache@98c8021b550208e191a6a3145459bfc9fb29c4c0 # v2.8.0
uses: Swatinem/rust-cache@f13886b937689c021905a6b90929199931d60db1 # v2.8.1
with:
shared-key: "amd-ci-check" # this job uses it's own cache becase check has a separate cache and we need it to be fast as it blocks other jobs
save-if: ${{ github.ref_name == 'main' }}
Expand Down Expand Up @@ -108,7 +108,7 @@ jobs:
with:
rust-version: stable
- name: Rust Dependency Cache
uses: Swatinem/rust-cache@98c8021b550208e191a6a3145459bfc9fb29c4c0 # v2.8.0
uses: Swatinem/rust-cache@f13886b937689c021905a6b90929199931d60db1 # v2.8.1
with:
save-if: false # set in linux-test
shared-key: "amd-ci"
Expand Down Expand Up @@ -176,7 +176,7 @@ jobs:
with:
rust-version: stable
- name: Rust Dependency Cache
uses: Swatinem/rust-cache@98c8021b550208e191a6a3145459bfc9fb29c4c0 # v2.8.0
uses: Swatinem/rust-cache@f13886b937689c021905a6b90929199931d60db1 # v2.8.1
with:
save-if: false # set in linux-test
shared-key: "amd-ci"
Expand Down Expand Up @@ -281,7 +281,7 @@ jobs:
with:
rust-version: stable
- name: Rust Dependency Cache
uses: Swatinem/rust-cache@98c8021b550208e191a6a3145459bfc9fb29c4c0 # v2.8.0
uses: Swatinem/rust-cache@f13886b937689c021905a6b90929199931d60db1 # v2.8.1
with:
save-if: ${{ github.ref_name == 'main' }}
shared-key: "amd-ci"
Expand All @@ -308,17 +308,18 @@ jobs:
name: cargo test datafusion-cli (amd64)
needs: linux-build-lib
runs-on: ubuntu-latest
# should be uncommented once https://github.com/apache/datafusion/pull/16644 is merged
# and cache should be added
# container:
# image: amd64/rust
steps:
- uses: actions/checkout@08c6903cd8c0fde910a37f88322edcfb5dd907a8 # v5.0.0
with:
submodules: true
fetch-depth: 1
- name: Setup Rust toolchain
run: rustup toolchain install stable
- name: Rust Dependency Cache
uses: Swatinem/rust-cache@f13886b937689c021905a6b90929199931d60db1 # v2.8.1
with:
save-if: false # set in linux-test
shared-key: "amd-ci"
- name: Run tests (excluding doctests)
env:
RUST_BACKTRACE: 1
Expand Down Expand Up @@ -348,7 +349,7 @@ jobs:
with:
rust-version: stable
- name: Rust Dependency Cache
uses: Swatinem/rust-cache@98c8021b550208e191a6a3145459bfc9fb29c4c0 # v2.8.0
uses: Swatinem/rust-cache@f13886b937689c021905a6b90929199931d60db1 # v2.8.1
with:
save-if: ${{ github.ref_name == 'main' }}
shared-key: "amd-ci-linux-test-example"
Expand Down Expand Up @@ -411,14 +412,16 @@ jobs:
sudo apt-get update -qq
sudo apt-get install -y -qq clang
- name: Setup wasm-pack
uses: taiki-e/install-action@2fdc5fd6ac805b0f8256893bd4c807bcb666af00 # v2
uses: taiki-e/install-action@d0f4f69b07c0804d1003ca9a5a5f853423872ed9 # v2
with:
tool: wasm-pack
- name: Run tests with headless mode
working-directory: ./datafusion/wasmtest
run: |
RUSTFLAGS='--cfg getrandom_backend="wasm_js"' wasm-pack test --headless --firefox
RUSTFLAGS='--cfg getrandom_backend="wasm_js"' wasm-pack test --headless --chrome --chromedriver $CHROMEWEBDRIVER/chromedriver
# debuginfo=none because CI tests weren't completing successfully after this upstream PR:
# https://github.com/wasm-bindgen/wasm-bindgen/pull/4635
RUSTFLAGS='--cfg getrandom_backend="wasm_js" -C debuginfo=none' wasm-pack test --headless --firefox
RUSTFLAGS='--cfg getrandom_backend="wasm_js" -C debuginfo=none' wasm-pack test --headless --chrome --chromedriver $CHROMEWEBDRIVER/chromedriver

# verify that the benchmark queries return the correct results
verify-benchmark-results:
Expand Down Expand Up @@ -533,22 +536,6 @@ jobs:
# export PATH=$PATH:$HOME/d/protoc/bin
# cargo test --lib --tests --bins --features avro,json,backtrace

# Commenting out intel mac build as so few users would ever use it
# Details: https://github.com/apache/datafusion/issues/13846
# macos:
# name: cargo test (macos)
# runs-on: macos-latest
# steps:
# - uses: actions/checkout@v4
# with:
# submodules: true
# fetch-depth: 1
# - name: Setup Rust toolchain
# uses: ./.github/actions/setup-macos-builder
# - name: Run tests (excluding doctests)
# shell: bash
# run: cargo test run --profile ci --exclude datafusion-examples --exclude datafusion-benchmarks --workspace --lib --tests --bins --features avro,json,backtrace

macos-aarch64:
name: cargo test (macos-aarch64)
runs-on: macos-14
Expand Down Expand Up @@ -680,7 +667,7 @@ jobs:
- name: Install Clippy
run: rustup component add clippy
- name: Rust Dependency Cache
uses: Swatinem/rust-cache@98c8021b550208e191a6a3145459bfc9fb29c4c0 # v2.8.0
uses: Swatinem/rust-cache@f13886b937689c021905a6b90929199931d60db1 # v2.8.1
with:
save-if: ${{ github.ref_name == 'main' }}
shared-key: "amd-ci-clippy"
Expand Down Expand Up @@ -752,7 +739,7 @@ jobs:
- name: Setup Rust toolchain
uses: ./.github/actions/setup-builder
- name: Install cargo-msrv
uses: taiki-e/install-action@2fdc5fd6ac805b0f8256893bd4c807bcb666af00 # v2
uses: taiki-e/install-action@d0f4f69b07c0804d1003ca9a5a5f853423872ed9 # v2
with:
tool: cargo-msrv

Expand Down
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -31,6 +31,7 @@ docker_cache
*.orig
.*.swp
.*.swo
*.pending-snap

venv/*

Expand Down
Loading
Loading