test-to-harness: initial set up #511

I think this is a promising direction, not least because we're seeing improvements and there are many technical improvements we can do since we now, more or less, only do a copy-paste of the test into the prompt. I think the PR is in a state though where we can do incremental improvements on this.

Particularly I think there are improvements needed in (1) architecture around benchmarks; (2) more context around tests; (3) more experiments around tests, e.g. we copy whole files in now where we could probably refine this (e.g. where there are multiple tests in a file we can extract the tests).

data_prep/introspector.py

DonggeLiu · 2024-07-31T10:36:16Z

experiment/benchmark.py

+                cppify_headers=cppify_headers,
+                commit=commit,
+                use_context=use_context,
+                function_dict=function))


OK for now, but would you please merge the same code in if/else block later to reduce repetition later?
Thanks

Yep, will do

llm_toolkit/prompt_builder.py

run_all_experiments.py

run_one_experiment.py

Signed-off-by: David Korczynski <[email protected]>

DavidKorczynski · 2024-08-02T20:50:59Z

/gcbrun exp -n dk-comparisonasdf12 -m vertex_ai_gemini-1-5 -b minor-for-ci

DavidKorczynski · 2024-08-02T20:56:20Z

/gcbrun exp -n dk-comparisonasfdf12 -m vertex_ai_gemini-1-5 -b minor-for-ci

This PR adds JVM project support for the test-to-harness approach initiated in #511. This PR also adds new benchmark set using the test-to-harness approach on Java projects. --------- Signed-off-by: Arthur Chan <[email protected]>

DavidKorczynski added 3 commits July 26, 2024 09:10

initial test-to-harness migration set up

67b7a84

Signed-off-by: David Korczynski <[email protected]>

fix style

8df02db

Signed-off-by: David Korczynski <[email protected]>

Merge branch 'main' into test-to-harness-migration-init

3717f05

DavidKorczynski added 5 commits July 27, 2024 03:49

add from-test benchmark set

4b41615

Signed-off-by: David Korczynski <[email protected]>

nit

7314038

Signed-off-by: David Korczynski <[email protected]>

nit

92c0667

Signed-off-by: David Korczynski <[email protected]>

cleanup

1f6d0df

Signed-off-by: David Korczynski <[email protected]>

nit

3ba15a9

Signed-off-by: David Korczynski <[email protected]>

DavidKorczynski added 2 commits July 27, 2024 04:19

nit

5771056

Signed-off-by: David Korczynski <[email protected]>

nit

498a2f8

Signed-off-by: David Korczynski <[email protected]>

Merge branch 'main' into test-to-harness-migration-init

639a15a

DavidKorczynski marked this pull request as ready for review July 27, 2024 13:50

DavidKorczynski requested review from DonggeLiu and oliverchang July 27, 2024 13:54

DonggeLiu approved these changes Jul 31, 2024

View reviewed changes

address review

1aafd12

Signed-off-by: David Korczynski <[email protected]>

Merge branch 'main' into test-to-harness-migration-init

2f1f544

DavidKorczynski merged commit 5b5ee46 into main Aug 2, 2024
6 checks passed

DavidKorczynski deleted the test-to-harness-migration-init branch August 2, 2024 21:40

arthurscchan mentioned this pull request Sep 9, 2024

JVM: test to harness approach for Java project #592

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test-to-harness: initial set up #511

test-to-harness: initial set up #511

DavidKorczynski commented Jul 26, 2024 •

edited

Loading

DavidKorczynski commented Jul 26, 2024

DavidKorczynski commented Jul 27, 2024

DavidKorczynski commented Jul 27, 2024

DavidKorczynski commented Jul 27, 2024

DavidKorczynski commented Jul 27, 2024

DavidKorczynski commented Jul 27, 2024

DavidKorczynski commented Jul 27, 2024

DavidKorczynski commented Jul 27, 2024

DavidKorczynski commented Jul 27, 2024

DonggeLiu Jul 31, 2024

DavidKorczynski Aug 2, 2024

DavidKorczynski commented Aug 2, 2024

DavidKorczynski commented Aug 2, 2024

test-to-harness: initial set up #511

test-to-harness: initial set up #511

Conversation

DavidKorczynski commented Jul 26, 2024 • edited Loading

DavidKorczynski commented Jul 26, 2024

DavidKorczynski commented Jul 27, 2024

DavidKorczynski commented Jul 27, 2024

DavidKorczynski commented Jul 27, 2024

DavidKorczynski commented Jul 27, 2024

DavidKorczynski commented Jul 27, 2024

DavidKorczynski commented Jul 27, 2024

DavidKorczynski commented Jul 27, 2024

DavidKorczynski commented Jul 27, 2024

DonggeLiu Jul 31, 2024

Choose a reason for hiding this comment

DavidKorczynski Aug 2, 2024

Choose a reason for hiding this comment

DavidKorczynski commented Aug 2, 2024

DavidKorczynski commented Aug 2, 2024

DavidKorczynski commented Jul 26, 2024 •

edited

Loading