Agent improvements: Adopt system instructions and allow multiple command executions #717

DonggeLiu · 2024-11-13T03:40:31Z

Allow passing system instructions to LLM
Allow executing multiple bash commands in one response
Prompt fixes
Minor corrections and bug fixes

DonggeLiu · 2024-11-13T03:43:40Z

In addition to the new features, this also generated buildable fuzz targets for project xs in local experiments for the first time (IIRC):

2024-11-13 14:33:05 [Trial ID: 01] INFO [logger.info]: ===== ROUND 10 Recompile =====
2024-11-13 14:33:11 [Trial ID: 01] DEBUG [logger.debug]: ROUND 10 compilation time: 0:00:06.169302
2024-11-13 14:33:11 [Trial ID: 01] DEBUG [logger.debug]: ROUND 10 Fuzz target compiles: True
2024-11-13 14:33:12 [Trial ID: 01] DEBUG [logger.debug]: ROUND 10 Final fuzz target binary exists: True
2024-11-13 14:33:13 [Trial ID: 01] DEBUG [logger.debug]: ROUND 10 Final fuzz target function referenced: True

Past:

DonggeLiu · 2024-11-13T03:49:13Z

/gcbrun exp -n dg -ag

DonggeLiu · 2024-11-13T11:10:35Z

Report: https://llm-exp.oss-fuzz.com/Result-reports/ofg-pr/2024-11-13-717-dg-comparison/index.html

Seeing many errors like:

File "/usr/local/lib/python3.11/dist-packages/google/api_core/grpc_helpers.py", line 76, in error_remapped_callable
return callable_(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.11/dist-packages/grpc/_channel.py", line 1181, in __call__
return _end_unary_response_blocking(state, call, False, None)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.11/dist-packages/grpc/_channel.py", line 1006, in _end_unary_response_blocking
raise _InactiveRpcError(state)  # pytype: disable=not-instantiable
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
grpc._channel._InactiveRpcError: <_InactiveRpcError of RPC that terminated with:
status = StatusCode.INVALID_ARGUMENT
details = "Unable to submit request because the input token count is 35103 but model only supports up to 32768. Reduce the input token count and try again. You can also use the CountTokens API to calculate prompt token count and billable characters. Learn more: https://cloud.google.com/vertex-ai/generative-ai/docs/learn/models"
debug_error_string = "UNKNOWN:Error received from peer ipv4:142.250.72.170:443 {grpc_message:"Unable to submit request because the input token count is 35103 but model only supports up to 32768. Reduce the input token count and try again. You can also use the CountTokens API to calculate prompt token count and billable characters. Learn more: https://cloud.google.com/vertex-ai/generative-ai/docs/learn/models", grpc_status:3, created_time:"2024-11-13T04:04:17.961683542+00:00"}"

This is likely due to the new system instructions added, I will lower input size limit accordingly.

Good news is finally got non-0 build rate on both benchmarks from xs:

DonggeLiu · 2024-11-13T11:55:00Z

/gcbrun exp -n dg1 -ag

DonggeLiu · 2024-11-14T05:45:07Z

/gcbrun exp -n dg -ag

DonggeLiu · 2024-11-14T06:33:55Z

/gcbrun exp -n dg -ag

DonggeLiu · 2024-11-14T10:11:23Z

/gcbrun exp -n dg -ag

DonggeLiu · 2024-11-14T11:32:42Z

/gcbrun exp -n dg -ag

DonggeLiu added 13 commits November 12, 2024 18:22

New system instructions

4458e38

Apply system instructions

a2a4922

Refine priming

8a08257

Minor correction

1debe6d

Bug fix

c6f8a55

Do not emphasize on simple/minimum fuzz target

e18071d

Complete conclusion protocol

1b71214

Minimize priming

c0537b1

Visually separate RESPONSE/PROMPT and their content by a line break

30b3cae

Strip empty lines and spaces from bash output

93fa4a6

Allow executing multiple bash commands in one response

d5c81ad

Allow passing system instructions to LLM

ca34bff

More concise objective and instructions

979c371

DonggeLiu added 3 commits November 13, 2024 22:52

Make code consistent

350209c

lower input token limit by system instruction token size

36a2270

Simplify system instruction

8fb6842

DonggeLiu added 4 commits November 14, 2024 16:38

Consider previous text in the same prompt when truncate new text

e90066e

minor fix

0196fe6

ASK LLM do not compile

0416570

Prioritize understanding over retrying

90f388f

DonggeLiu added 2 commits November 14, 2024 17:31

Remove the compile command so that LLM cannot learn

0412011

Debug truncating prompt

b31c7d7

DonggeLiu added 2 commits November 14, 2024 21:06

Reduce unnecessary logs

aa73d58

Fix bug to remove compile command from build result

ed66cb3

DonggeLiu added 3 commits November 14, 2024 21:07

Simpler debugging

5e8a629

Fix bug in truncation

e6d5042

Set log level

fe3e9ab

Fix truncation and be more strict on individual output size limit

60f3db1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agent improvements: Adopt system instructions and allow multiple command executions #717

Agent improvements: Adopt system instructions and allow multiple command executions #717

DonggeLiu commented Nov 13, 2024

DonggeLiu commented Nov 13, 2024 •

edited

Loading

DonggeLiu commented Nov 13, 2024

DonggeLiu commented Nov 13, 2024 •

edited

Loading

DonggeLiu commented Nov 13, 2024

DonggeLiu commented Nov 14, 2024

DonggeLiu commented Nov 14, 2024

DonggeLiu commented Nov 14, 2024

DonggeLiu commented Nov 14, 2024

Agent improvements: Adopt system instructions and allow multiple command executions #717

Are you sure you want to change the base?

Agent improvements: Adopt system instructions and allow multiple command executions #717

Conversation

DonggeLiu commented Nov 13, 2024

DonggeLiu commented Nov 13, 2024 • edited Loading

DonggeLiu commented Nov 13, 2024

DonggeLiu commented Nov 13, 2024 • edited Loading

DonggeLiu commented Nov 13, 2024

DonggeLiu commented Nov 14, 2024

DonggeLiu commented Nov 14, 2024

DonggeLiu commented Nov 14, 2024

DonggeLiu commented Nov 14, 2024

DonggeLiu commented Nov 13, 2024 •

edited

Loading

DonggeLiu commented Nov 13, 2024 •

edited

Loading