Skip to content

Conversation

@KtorZ
Copy link

@KtorZ KtorZ commented Oct 16, 2025

Those benchmarks are real-world scripts extracted from mainnet, and to
which the script context has been pre-applied.

They've been produced by https://github.com/r2rationality/turbocardano.

This commit adds a preliminary setup to run those benchmarks, and
measure our performance against real data. Using only data from epoch
519, 520 and 521, we already run into cases where the VM crashes.

The reason for the crash could be plural: a bug in the VM
implementation, or a bug in whomever produced the benchmark. Either
way though, the VM should not crash but fail gracefully (or succeed,
should the script actually be valid).

Also, I have restricted the benchmarks to V3 only, since the semantic
for V1 and V2 are still to be implemented.

The goal from here would be to get all those benchmarks to pass; and
ultimately compare with the Haskell & C++ implementations.


cc @yHSJ @jonathanlim222 @sierkov

  Those benchmarks are real-world scripts extracted from mainnet, and to
  which the script context has been pre-applied.

  They've been produced by https://github.com/r2rationality/turbocardano.

  This commit adds a preliminary setup to run those benchmarks, and
  measure our performance against real data. Using only data from epoch
  519, 520 and 521, we already run into cases where the VM crashes.

  The reason for the crash could be plural: a bug in the VM
  implementation, or a bug in whomever produced the benchmark. Either
  way though, the VM should not crash but fail gracefully (or succeed,
  should the script actually be valid).

  Also, I have restricted the benchmarks to V3 only, since the semantic
  for V1 and V2 are still to be implemented.

  The goal from here would be to get all those benchmarks to pass; and
  ultimately compare with the Haskell & C++ implementations.
@KtorZ KtorZ requested a review from a team as a code owner October 16, 2025 16:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants