Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

506 refactor value handling #511

Merged
merged 15 commits into from
Mar 27, 2025
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion kmir/pyproject.toml
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@ build-backend = "poetry.core.masonry.api"

[tool.poetry]
name = "kmir"
version = "0.3.105"
version = "0.3.106"
description = ""
authors = [
"Runtime Verification, Inc. <[email protected]>",
Expand Down
2 changes: 1 addition & 1 deletion kmir/src/kmir/__init__.py
Original file line number Diff line number Diff line change
@@ -1,3 +1,3 @@
from typing import Final

VERSION: Final = '0.3.105'
VERSION: Final = '0.3.106'
94 changes: 60 additions & 34 deletions kmir/src/kmir/kdist/mir-semantics/kmir.md
Original file line number Diff line number Diff line change
Expand Up @@ -259,7 +259,7 @@ be known to populate the `currentFunc` field.

rule #reserveFor(localDecl(TY, _, MUT) REST:LocalDecls)
=>
ListItem(noValue(TY, MUT)) #reserveFor(REST)
ListItem(newLocal(TY, MUT)) #reserveFor(REST)
```

Executing a function body consists of repeated calls to `#execBlock`
Expand Down Expand Up @@ -312,19 +312,12 @@ will effectively be no-ops at this level).
// all memory accesses relegated to another module (to be added)
rule <k> #execStmt(statement(statementKindAssign(PLACE, RVAL), _SPAN))
=>
RVAL ~> #assign(PLACE)
#setLocalValue(PLACE, RVAL)
...
</k>

// RVAL evaluation is implemented in rt/data.md

syntax KItem ::= #assign ( Place )

rule <k> VAL:TypedLocal ~> #assign(PLACE) ~> CONT
=>
VAL ~> #setLocalValue(PLACE) ~> CONT
</k>

rule <k> #execStmt(statement(statementKindSetDiscriminant(_PLACE, _VARIDX), _SPAN))
=>
.K // FIXME! write variant index to PLACE
Expand Down Expand Up @@ -359,42 +352,41 @@ function call, pushing a new stack frame and returning to a different
block after the call returns.

```k
rule <k> #execTerminator(terminator(terminatorKindGoto(I), _SPAN))
rule <k> #execTerminator(terminator(terminatorKindGoto(I), _SPAN)) ~> _CONT
=>
#execBlockIdx(I)
... // FIXME: We assume this is empty. Explicitly throw away or check that it is?
</k>
```

A `SwitchInt` terminator selects one of the blocks given as _targets_,
depending on the value of a _discriminant_.

```k
syntax KItem ::= #selectBlock ( SwitchTargets , Evaluation ) [strict(2)]

rule <k> #execTerminator(terminator(terminatorKindSwitchInt(DISCR, TARGETS), _SPAN)) ~> _CONT
=>
#readOperand(DISCR) ~> #selectBlock(TARGETS)
#selectBlock(TARGETS, DISCR)
</k>

rule <k> typedLocal(Integer(I, _, _), _, _) ~> #selectBlock(TARGETS)
rule <k> #selectBlock(TARGETS, typedValue(Integer(I, _, _), _, _))
=>
#execBlockIdx(#selectBlock(I, TARGETS))
...
</k>

rule <k> typedLocal(BoolVal(false), _, _) ~> #selectBlock(TARGETS)
rule <k> #selectBlock(TARGETS, typedValue(BoolVal(false), _, _))
=>
#execBlockIdx(#selectBlock(0, TARGETS))
...
</k>

rule <k> typedLocal(BoolVal(true), _, _) ~> #selectBlock(TARGETS)
rule <k> #selectBlock(TARGETS, typedValue(BoolVal(true), _, _))
=>
#execBlockIdx(#selectBlock(1, TARGETS))
...
</k>

syntax KItem ::= #selectBlock ( SwitchTargets )

syntax BasicBlockIdx ::= #selectBlock ( Int , SwitchTargets) [function, total]
| #selectBlockAux ( Int, Branches, BasicBlockIdx ) [function, total]

Expand Down Expand Up @@ -423,10 +415,12 @@ context of the enclosing stack frame, at the _target_.
If the returned value is a `Reference`, its stack height must be decremented because a stack frame is popped.
NB that a stack height of `0` cannot occur here, because the compiler prevents local variable references from escaping.

If the loval `_0` does not have a value (i.e., it remained uninitialised), the function returns unit and writing the value is skipped.

```k
rule <k> #execTerminator(terminator(terminatorKindReturn, _SPAN)) ~> _
=>
#decrementRef(LOCAL0) ~> #setLocalValue(DEST) ~> #execBlockIdx(TARGET) ~> .K
#setLocalValue(DEST, #decrementRef(LOCAL0)) ~> #execBlockIdx(TARGET)
</k>
<currentFunc> _ => CALLER </currentFunc>
//<currentFrame>
Expand All @@ -435,14 +429,33 @@ NB that a stack height of `0` cannot occur here, because the compiler prevents l
<dest> DEST => NEWDEST </dest>
<target> someBasicBlockIdx(TARGET) => NEWTARGET </target>
<unwind> _ => UNWIND </unwind>
<locals> ListItem(LOCAL0:TypedLocal) _ => NEWLOCALS </locals>
<locals> ListItem(LOCAL0:TypedValue) _ => NEWLOCALS </locals>
//</currentFrame>
// remaining call stack (without top frame)
<stack> ListItem(StackFrame(NEWCALLER, NEWDEST, NEWTARGET, UNWIND, NEWLOCALS)) STACK => STACK </stack>
<functions> FUNCS </functions>
requires CALLER in_keys(FUNCS)
// andBool DEST #within(LOCALS)
[preserves-definedness] // CALLER lookup defined, DEST within locals TODO
[preserves-definedness] // CALLER lookup defined

// no value to return, skip writing
rule <k> #execTerminator(terminator(terminatorKindReturn, _SPAN)) ~> _
=>
#execBlockIdx(TARGET)
</k>
<currentFunc> _ => CALLER </currentFunc>
//<currentFrame>
<currentBody> _ => #getBlocks(FUNCS, CALLER) </currentBody>
<caller> CALLER => NEWCALLER </caller>
<dest> _ => NEWDEST </dest>
<target> someBasicBlockIdx(TARGET) => NEWTARGET </target>
<unwind> _ => UNWIND </unwind>
<locals> ListItem(_:NewLocal) _ => NEWLOCALS </locals>
//</currentFrame>
// remaining call stack (without top frame)
<stack> ListItem(StackFrame(NEWCALLER, NEWDEST, NEWTARGET, UNWIND, NEWLOCALS)) STACK => STACK </stack>
<functions> FUNCS </functions>
requires CALLER in_keys(FUNCS)
[preserves-definedness] // CALLER lookup defined
Comment on lines +440 to +458
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am trying to think of what this looks like in the mir code. What is an example of this?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The new rule is for the case where local _0 has not been written to (it is still sort NewLocal ) but the function returns. That happens regularly when a function returns nothing (i.e., unit). We used to "write" the noValue as a NoOp but now the sort is different so we should skip it.


syntax List ::= #getBlocks(Map, Ty) [function]
| #getBlocksAux(MonoItemKind) [function, total]
Expand Down Expand Up @@ -474,7 +487,7 @@ The call stack is not necessarily empty at this point so it is left untouched.
<retVal> _ => return(VAL) </retVal>
<currentFrame>
<target> noBasicBlockIdx </target>
<locals> ListItem(typedLocal(VAL, _, _)) ... </locals>
<locals> ListItem(typedValue(VAL, _, _)) ... </locals>
...
</currentFrame>

Expand All @@ -485,7 +498,7 @@ The call stack is not necessarily empty at this point so it is left untouched.
</k>
<currentFrame>
<target> noBasicBlockIdx </target>
<locals> ListItem(noValue(_, _)) ... </locals>
<locals> ListItem(newLocal(_, _)) ... </locals>
...
</currentFrame>
```
Expand All @@ -496,10 +509,9 @@ where the returned result should go.


```k
rule <k> #execTerminator(terminator(terminatorKindCall(FUNC, ARGS, DEST, TARGET, UNWIND), _SPAN))
rule <k> #execTerminator(terminator(terminatorKindCall(FUNC, ARGS, DEST, TARGET, UNWIND), _SPAN)) ~> _
=>
#setUpCalleeData({FUNCTIONS[#tyOfCall(FUNC)]}:>MonoItemKind, ARGS)
...
</k>
<currentFunc> CALLER => #tyOfCall(FUNC) </currentFunc>
<currentFrame>
Expand Down Expand Up @@ -566,13 +578,13 @@ An operand may be a `Reference` (the only way a function could access another fu

rule <k> #setArgFromStack(IDX, operandConstant(_) #as CONSTOPERAND)
=>
#readOperand(CONSTOPERAND) ~> #setLocalValue(place(local(IDX), .ProjectionElems))
#setLocalValue(place(local(IDX), .ProjectionElems), CONSTOPERAND)
...
</k>

rule <k> #setArgFromStack(IDX, operandCopy(place(local(I), .ProjectionElems)))
=>
#incrementRef({CALLERLOCALS[I]}:>TypedLocal) ~> #setLocalValue(place(local(IDX), .ProjectionElems))
#setLocalValue(place(local(IDX), .ProjectionElems), #incrementRef({CALLERLOCALS[I]}:>TypedLocal))
...
</k>
<stack> ListItem(StackFrame(_, _, _, _, CALLERLOCALS)) _:List </stack>
Expand All @@ -584,7 +596,7 @@ An operand may be a `Reference` (the only way a function could access another fu

rule <k> #setArgFromStack(IDX, operandMove(place(local(I), .ProjectionElems)))
=>
#incrementRef({CALLERLOCALS[I]}:>TypedLocal) ~> #setLocalValue(place(local(IDX), .ProjectionElems))
#setLocalValue(place(local(IDX), .ProjectionElems), #incrementRef({CALLERLOCALS[I]}:>TypedLocal))
...
</k>
<stack> ListItem(StackFrame(_, _, _, _, CALLERLOCALS => CALLERLOCALS[I <- Moved])) _:List
Expand All @@ -597,25 +609,39 @@ An operand may be a `Reference` (the only way a function could access another fu
```
The `Assert` terminator checks that an operand holding a boolean value (which has previously been computed, e.g., an overflow flag for arithmetic operations) has the expected value (e.g., that this overflow flag is `false` - a very common case).
If the condition value is as expected, the program proceeds with the given `target` block.
Otherwise the provided message is passed to a `panic!` call, ending the program with an error, modelled as an `#AssertError` in the semantics.
Otherwise the provided message is passed to a `panic!` call, ending the program with an error, modelled as an `AssertError` in the semantics.

```k
syntax KItem ::= #AssertError ( AssertMessage )
syntax MIRError ::= AssertError ( AssertMessage )

rule <k> #execTerminator(terminator(assert(COND, EXPECTED, MSG, TARGET, _UNWIND), _SPAN)) ~> _CONT
=>
#readOperand(COND) ~> #expect(EXPECTED, MSG) ~> #execBlockIdx(TARGET)
#expect(COND, EXPECTED, MSG) ~> #execBlockIdx(TARGET)
</k>

syntax KItem ::= #expect ( Bool, AssertMessage )
syntax KItem ::= #expect ( Evaluation, Bool, AssertMessage ) [strict(1)]

rule <k> typedLocal(BoolVal(COND), _, _) ~> #expect(EXPECTED, _MSG) => .K ... </k>
rule <k> #expect(typedValue(BoolVal(COND), _, _), EXPECTED, _MSG) => .K ... </k>
requires COND ==Bool EXPECTED

rule <k> typedLocal(BoolVal(COND), _, _) ~> #expect(EXPECTED, MSG) => #AssertError(MSG) ... </k>
rule <k> #expect(typedValue(BoolVal(COND), _, _), EXPECTED, MSG) => AssertError(MSG) ... </k>
requires COND =/=Bool EXPECTED
```

### Stopping on Program Errors

The semantics has a dedicated error sort to stop execution when flawed input or undefined behaviour is detected.
This includes cases of invalid MIR (e.g., accessing non-existing locals in a block or jumping to non-existing blocks), mutation of immutable values, or accessing uninitialised locals, but also user errors such as division by zero or overflowing unchecked arithmetic operations.

The execution will stop with the respective error information as soon as an error condition is detected.

```k
syntax KItem ::= #ProgramError ( MIRError )

rule [program-error]:
<k> ERR:MIRError => #ProgramError(ERR) ...</k>
```

```k
endmodule
```
Expand Down
Loading