chore: extract comparison into separate tool #2632

comphead · 2025-10-22T22:07:13Z

Which issue does this PR close?

Related #2614 #2611 .

Rationale for this change

Extract comparison to separate tool to run against already generated Comet and Spark results

What changes are included in this PR?

How are these changes tested?

comphead · 2025-10-22T22:11:16Z

fuzz-testing/src/main/scala/org/apache/comet/fuzz/QueryRunner.scala

      case (a: Array[_], b: Array[_]) =>
        a.length == b.length && a.zip(b).forall(x => same(x._1, x._2))
-      case (a: WrappedArray[_], b: WrappedArray[_]) =>
+      case (a: mutable.WrappedArray[_], b: mutable.WrappedArray[_]) =>


moved it from #2614

codecov-commenter · 2025-10-22T22:28:41Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 59.16%. Comparing base (f09f8af) to head (9cee835).
⚠️ Report is 635 commits behind head on main.

Additional details and impacted files

@@             Coverage Diff              @@
##               main    #2632      +/-   ##
============================================
+ Coverage     56.12%   59.16%   +3.03%     
- Complexity      976     1436     +460     
============================================
  Files           119      147      +28     
  Lines         11743    13735    +1992     
  Branches       2251     2356     +105     
============================================
+ Hits           6591     8126    +1535     
- Misses         4012     4386     +374     
- Partials       1140     1223      +83

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

andygrove · 2025-10-23T14:09:31Z

I don't think that we should have a combined fuzz-testing-and-tpc-benchmark tool. They serve quite different purposes. I think it would be better to move the DataFrame comparison logic into a shared class somewhere and then update our benchmarking tool to be able to use it.

This probably means that we need to convert our benchmark script from Python to Scala.

andygrove · 2025-10-23T14:10:39Z

I don't think that we should have a combined fuzz-testing-and-tpc-benchmark tool. They serve quite different purposes. I think it would be better to move the DataFrame comparison logic into a shared class somewhere and then update our benchmarking tool to be able to use it.

This probably means that we need to convert our benchmark script from Python to Scala.

Another option would be to update the existing Python benchmark script to save query results to Parquet, and then implement a command-line tool for comparing the Parquet files produced from the Spark and Comet runs.

andygrove · 2025-10-23T15:34:47Z

I don't think that we should have a combined fuzz-testing-and-tpc-benchmark tool. They serve quite different purposes. I think it would be better to move the DataFrame comparison logic into a shared class somewhere and then update our benchmarking tool to be able to use it.
This probably means that we need to convert our benchmark script from Python to Scala.

Another option would be to update the existing Python benchmark script to save query results to Parquet, and then implement a command-line tool for comparing the Parquet files produced from the Spark and Comet runs.

I created #2640 to add a new option to the benchmark script, to write query results to Parquet.

comphead · 2025-10-23T16:33:51Z

I don't think that we should have a combined fuzz-testing-and-tpc-benchmark tool. They serve quite different purposes. I think it would be better to move the DataFrame comparison logic into a shared class somewhere and then update our benchmarking tool to be able to use it.
This probably means that we need to convert our benchmark script from Python to Scala.

Another option would be to update the existing Python benchmark script to save query results to Parquet, and then implement a command-line tool for comparing the Parquet files produced from the Spark and Comet runs.

Right, this option looks better IMO so we can have a command line utility similar to fuzzer and reuse comparison logic. We still need this PR in some way as it has some refactoring to reuse comparison

chore: add TPC queries to be run by fuzzer correctness checker

91f9ff1

comphead requested a review from andygrove October 22, 2025 22:07

comphead commented Oct 22, 2025

View reviewed changes

comphead marked this pull request as draft October 23, 2025 17:17

chore: extract comparison tool from fuzzer

d579011

comphead changed the title ~~chore: add TPC queries to be run by fuzzer correctness checker~~ chore: extract comparison into separate tool Oct 23, 2025

comphead added 3 commits October 23, 2025 11:43

chore: extract comparison tool from fuzzer

d14ea87

Merge remote-tracking branch 'upstream/main' into update_fuzzer

848e3b3

chore: extract comparison tool from fuzzer

9cee835

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore: extract comparison into separate tool #2632

chore: extract comparison into separate tool #2632

comphead commented Oct 22, 2025 •

edited

Loading

Uh oh!

comphead Oct 22, 2025

Uh oh!

codecov-commenter commented Oct 22, 2025 •

edited

Loading

Uh oh!

andygrove commented Oct 23, 2025

Uh oh!

andygrove commented Oct 23, 2025

Uh oh!

andygrove commented Oct 23, 2025

Uh oh!

comphead commented Oct 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

chore: extract comparison into separate tool #2632

Are you sure you want to change the base?

chore: extract comparison into separate tool #2632

Conversation

comphead commented Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

How are these changes tested?

Uh oh!

comphead Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

codecov-commenter commented Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

andygrove commented Oct 23, 2025

Uh oh!

andygrove commented Oct 23, 2025

Uh oh!

andygrove commented Oct 23, 2025

Uh oh!

comphead commented Oct 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

comphead commented Oct 22, 2025 •

edited

Loading

codecov-commenter commented Oct 22, 2025 •

edited

Loading