Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Include more request configuration options into the span attributes for the Google GenAI SDK instrumentation #3374

Open
wants to merge 23 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
23 commits
Select commit Hold shift + click to select a range
14ab0b4
Create a utility to simplify recording request attributes.
michaelsafyan Mar 6, 2025
cd5a36a
Merge branch 'open-telemetry:main' into google_genai_attribute_improv…
michaelsafyan Mar 14, 2025
2fc0ad1
Update recording mechanism to record more request options.
michaelsafyan Mar 14, 2025
969c003
Merge branch 'open-telemetry:main' into google_genai_attribute_improv…
michaelsafyan Mar 19, 2025
88b7e45
Improve the recording of span request attributes.
michaelsafyan Mar 19, 2025
744ef1c
Reformat with ruff.
michaelsafyan Mar 19, 2025
76c84c3
Update TODOs to reflect change made here.
michaelsafyan Mar 19, 2025
3953ea1
Update changelog now that PR has been created and can be referenced.
michaelsafyan Mar 19, 2025
d6f5f36
Merge branch 'open-telemetry:main' into google_genai_attribute_improv…
michaelsafyan Mar 25, 2025
7354e6f
Merge branch 'main' into google_genai_attribute_improvements
michaelsafyan Mar 26, 2025
d3526fa
Merge branch 'main' into google_genai_attribute_improvements
michaelsafyan Mar 27, 2025
80c8df1
Fix lint issues.
michaelsafyan Mar 27, 2025
43987e4
Reformat with ruff.
michaelsafyan Mar 27, 2025
6b8c599
Merge branch 'main' into google_genai_attribute_improvements
michaelsafyan Mar 28, 2025
aaaa017
Add more documentation comments requested in the pull request.
michaelsafyan Mar 28, 2025
3d911b6
Merge branch 'main' into google_genai_attribute_improvements
michaelsafyan Mar 28, 2025
5153080
Merge branch 'main' into google_genai_attribute_improvements
michaelsafyan Mar 31, 2025
031369b
Merge branch 'main' into google_genai_attribute_improvements
michaelsafyan Apr 2, 2025
fa8fa60
Add tests and comments that provide some additional clarity regarding…
michaelsafyan Apr 4, 2025
2a08ecd
Add tests and comments that provide some additional clarity regarding…
michaelsafyan Apr 4, 2025
97dab62
Handle corner case where flatten function returns compound output.
michaelsafyan Apr 4, 2025
3107e56
Update prefix to match currently proposed SemConv.
michaelsafyan Apr 4, 2025
cb4ca3b
Update to specify attributes from SemConv constants per PR feedback.
michaelsafyan Apr 4, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0

## Unreleased

- Add more request configuration options to the span attributes ([#3374](https://github.com/open-telemetry/opentelemetry-python-contrib/pull/3374))
- Restructure tests to keep in line with repository conventions ([#3344](https://github.com/open-telemetry/opentelemetry-python-contrib/pull/3344))

## Version 0.1b0 (2025-03-05)
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,6 @@

Here are some TODO items required to achieve stability for this package:

- Add more span-level attributes for request configuration
- Add more span-level attributes for response information
- Verify and correct formatting of events:
- Including the 'role' field for message events
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -37,9 +37,9 @@ classifiers = [
"Programming Language :: Python :: 3.12"
]
dependencies = [
"opentelemetry-api >=1.30.0, <2",
"opentelemetry-instrumentation >=0.51b0, <2",
"opentelemetry-semantic-conventions >=0.51b0, <2"
"opentelemetry-api >=1.31.1, <2",
"opentelemetry-instrumentation >=0.52b1, <2",
"opentelemetry-semantic-conventions >=0.52b1, <2"
]

[project.optional-dependencies]
Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,18 @@
# Copyright The OpenTelemetry Authors
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.


# Prefix to use for LLM model request attributes that are unique GCP
# (or that have not yet been formally defined in the GenAI/LLM SIG).
CUSTOM_LLM_REQUEST_PREFIX = "gcp.gen_ai.request"
Original file line number Diff line number Diff line change
@@ -0,0 +1,235 @@
# Copyright The OpenTelemetry Authors
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.


import json
from typing import Any, Callable, Dict, Optional, Sequence, Set, Union

Primitive = Union[bool, str, int, float]
BoolList = list[bool]
StringList = list[str]
IntList = list[int]
FloatList = list[float]
HomogenousPrimitiveList = Union[BoolList, StringList, IntList, FloatList]
FlattenedValue = Union[Primitive, HomogenousPrimitiveList]
FlattenedDict = Dict[str, FlattenedValue]


def _concat_key(prefix: Optional[str], suffix: str):
if not prefix:
return suffix
return f"{prefix}.{suffix}"


def _is_primitive(v):
for t in [str, bool, int, float]:
if isinstance(v, t):
return True
return False


def _is_homogenous_primitive_list(v):
if not isinstance(v, list):
return False
if len(v) == 0:
return True
if not _is_primitive(v[0]):
return False
first_entry_value_type = type(v[0])
for entry in v[1:]:
if not isinstance(entry, first_entry_value_type):
return False
return True


def _get_flatten_func(
flatten_functions: Dict[str, Callable], key_names: set[str]
):
for key in key_names:
flatten_func = flatten_functions.get(key)
if flatten_func is not None:
return flatten_func
return None


def _flatten_compound_value(
key: str,
value: Any,
exclude_keys: Set[str],
rename_keys: Dict[str, str],
flatten_functions: Dict[str, Callable],
key_names: Set[str],
_from_json=False,
) -> FlattenedDict:
flatten_func = _get_flatten_func(flatten_functions, key_names)
if flatten_func is not None:
func_output = flatten_func(
key,
value,
exclude_keys=exclude_keys,
rename_keys=rename_keys,
flatten_functions=flatten_functions,
)
if func_output is None:
return {}
elif _is_primitive(func_output) or _is_homogenous_primitive_list(func_output):
return {key: func_output}
else:
value = func_output
if isinstance(value, dict):
return _flatten_dict(
value,
key_prefix=key,
exclude_keys=exclude_keys,
rename_keys=rename_keys,
flatten_functions=flatten_functions,
)
if isinstance(value, list):
if _is_homogenous_primitive_list(value):
return {key: value}
return _flatten_list(
value,
key_prefix=key,
exclude_keys=exclude_keys,
rename_keys=rename_keys,
flatten_functions=flatten_functions,
)
if hasattr(value, "model_dump"):
return _flatten_dict(
value.model_dump(),
key_prefix=key,
exclude_keys=exclude_keys,
rename_keys=rename_keys,
flatten_functions=flatten_functions,
)
if _from_json:
raise ValueError(
f"Cannot flatten value with key {key}; value: {value}"
)
Comment on lines +116 to +119
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It looks like this causes the whole call to flatten_dict() to fail and no attributes would get added to the span

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes. The flatten_dict either succeeds or it does not. I think it is better to fail fully, loudly with a clear error than to partially succeed, making it difficult to debug why certain information is absent.

try:
json_string = json.dumps(value)
except TypeError as exc:
raise ValueError(
f"Cannot flatten value with key {key}; value: {value}. Not JSON serializable."
) from exc
json_value = json.loads(json_string)
return _flatten_value(
key,
json_value,
exclude_keys=exclude_keys,
rename_keys=rename_keys,
flatten_functions=flatten_functions,
# Ensure that we don't recurse indefinitely if "json.loads()" somehow returns
# a complex, compound object that does not get handled by the "primitive", "list",
# or "dict" cases. Prevents falling back on the JSON serialization fallback path.
_from_json=True,
)


def _flatten_value(
key: str,
value: Any,
exclude_keys: Set[str],
rename_keys: Dict[str, str],
flatten_functions: Dict[str, Callable],
_from_json=False,
) -> FlattenedDict:
if value is None:
return {}
key_names = set([key])
renamed_key = rename_keys.get(key)
if renamed_key is not None:
key_names.add(renamed_key)
key = renamed_key
if key_names & exclude_keys:
return {}
if _is_primitive(value):
return {key: value}
return _flatten_compound_value(
key=key,
value=value,
exclude_keys=exclude_keys,
rename_keys=rename_keys,
flatten_functions=flatten_functions,
key_names=key_names,
_from_json=_from_json,
)


def _flatten_dict(
d: Dict[str, Any],
key_prefix: str,
exclude_keys: Set[str],
rename_keys: Dict[str, str],
flatten_functions: Dict[str, Callable],
) -> FlattenedDict:
result = {}
for key, value in d.items():
if key in exclude_keys:
continue
full_key = _concat_key(key_prefix, key)
flattened = _flatten_value(
full_key,
value,
exclude_keys=exclude_keys,
rename_keys=rename_keys,
flatten_functions=flatten_functions,
)
result.update(flattened)
return result


def _flatten_list(
lst: list[Any],
key_prefix: str,
exclude_keys: Set[str],
rename_keys: Dict[str, str],
flatten_functions: Dict[str, Callable],
) -> FlattenedDict:
result = {}
result[_concat_key(key_prefix, "length")] = len(lst)
for index, value in enumerate(lst):
full_key = f"{key_prefix}[{index}]"
Comment on lines +201 to +203
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What are the foo.length and foo[i] for? I don't see those in the test cases anywhere

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added tests to cover this case. This applies where there are lists of either complex objects or of heterogenous primitives. Only in the case of homogenous primitive lists are the lists encoded as plain lists.

I think this should now be much clearer with those tests in place.

flattened = _flatten_value(
full_key,
value,
exclude_keys=exclude_keys,
rename_keys=rename_keys,
flatten_functions=flatten_functions,
)
result.update(flattened)
return result


def flatten_dict(
d: Dict[str, Any],
key_prefix: Optional[str] = None,
exclude_keys: Optional[Sequence[str]] = None,
rename_keys: Optional[Dict[str, str]] = None,
flatten_functions: Optional[Dict[str, Callable]] = None,
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you add a signature to the callable?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not sure how to express it in type annotations...

The signature is:

def flatten_func(
        key: str,
        value: any,
        exclude_keys=Set[str],
        rename_keys=Dict[str, str],
        flatten_functions=Dict[str, flatten_func],
        **kwargs) -> Union[FlattenedDict, FlattenedValue]

):
key_prefix = key_prefix or ""
if exclude_keys is None:
exclude_keys = set()
elif isinstance(exclude_keys, list):
exclude_keys = set(exclude_keys)
Comment on lines +223 to +226
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
if exclude_keys is None:
exclude_keys = set()
elif isinstance(exclude_keys, list):
exclude_keys = set(exclude_keys)
exclude_keys = set(exclude_keys or [])

rename_keys = rename_keys or {}
flatten_functions = flatten_functions or {}
return _flatten_dict(
d,
key_prefix=key_prefix,
exclude_keys=exclude_keys,
rename_keys=rename_keys,
flatten_functions=flatten_functions,
)
Loading
Loading