generated from amazon-archives/__template_Apache-2.0
    
        
        - 
                Notifications
    You must be signed in to change notification settings 
- Fork 621
Add documentation for unified OTLP source in Data prepper #11033
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
          
     Merged
      
        
      
    
  
     Merged
                    Changes from 8 commits
      Commits
    
    
            Show all changes
          
          
            11 commits
          
        
        Select commit
          Hold shift + click to select a range
      
      0a790ff
              
                add documentation for unified OTLP source
              
              
                ps48 238d4ed
              
                update getEventType function documentation
              
              
                ps48 92e6c18
              
                Adding style updates
              
              
                ps48 756ea31
              
                update when condition
              
              
                ps48 a535376
              
                resolve comments
              
              
                ps48 ecdafdc
              
                Doc review
              
              
                kolchfa-aws e3b777e
              
                update migration section with example
              
              
                ps48 213aca2
              
                Apply suggestions from code review
              
              
                kolchfa-aws 1018e02
              
                Update _data-prepper/pipelines/configuration/sources/otlp-source.md
              
              
                kolchfa-aws 87cfc93
              
                Update _data-prepper/pipelines/configuration/sources/otlp-source.md
              
              
                kolchfa-aws 2f57dc0
              
                Apply suggestions from code review
              
              
                natebower File filter
Filter by extension
Conversations
          Failed to load comments.   
        
        
          
      Loading
        
  Jump to
        
          Jump to file
        
      
      
          Failed to load files.   
        
        
          
      Loading
        
  Diff view
Diff view
There are no files selected for viewing
        
          
  
    
      
          
            335 changes: 335 additions & 0 deletions
          
          335 
        
  _data-prepper/pipelines/configuration/sources/otlp-source.md
  
  
      
      
   
        
      
      
    
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              | Original file line number | Diff line number | Diff line change | 
|---|---|---|
| @@ -0,0 +1,335 @@ | ||
| --- | ||
| layout: default | ||
| title: OTLP source | ||
| parent: Sources | ||
| grand_parent: Pipelines | ||
| nav_order: 85 | ||
| --- | ||
|  | ||
| # OTLP source | ||
|  | ||
| The `otlp` source is a unified OpenTelemetry source that follows the [OpenTelemetry Protocol (OTLP) specification](https://opentelemetry.io/docs/specs/otlp/) and can receive logs, metrics, and traces through a single endpoint. This source consolidates the functionality of the individual `otel_logs_source`, `otel_metrics_source`, and `otel_trace_source` sources, providing a streamlined approach to ingesting all OpenTelemetry telemetry signals. | ||
|  | ||
| The OTLP source supports both the `OTLP/gRPC` and `OTLP/HTTP` protocols. For `OTLP/HTTP`, only Protobuf encoding is supported. This makes it compatible with a wide range of OpenTelemetry collectors and instrumentation libraries. | ||
| {: .note} | ||
|  | ||
| ## Configuration | ||
|  | ||
| You can configure the `otlp` source with the following options. | ||
|  | ||
| | Option | Type | Description | | ||
| | :--- | :--- | :--- | | ||
| | `port` | Integer | The port on which the OTLP source listens. Default is `21893`. | | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
| | `logs_path` | String | The path for sending unframed HTTP requests for logs. Must start with `/` and have a minimum length of 1. Default is `/opentelemetry.proto.collector.logs.v1.LogsService/Export`. | | ||
| | `metrics_path` | String | The path for sending unframed HTTP requests for metrics. Must start with `/` and have a minimum length of 1. Default is `/opentelemetry.proto.collector.metrics.v1.MetricsService/Export`. | | ||
| | `traces_path` | String | The path for sending unframed HTTP requests for traces. Must start with `/` and have a minimum length of 1. Default is `/opentelemetry.proto.collector.trace.v1.TraceService/Export`. | | ||
| | `request_timeout` | Duration | The request timeout duration. Default is `10s`. | | ||
| | `retry_info` | Object | Configures retry behavior. Supports `min_delay` (default `100ms`) and `max_delay` (default `2s`) parameters to control exponential backoff. See [Retry information](#retry-information).| | ||
| | `health_check_service` | Boolean | Enables a gRPC health check service under `grpc.health.v1.Health/Check`. When `unframed_requests` is `true`, enables HTTP health check at `/health`. Default is `false`. | | ||
| | `proto_reflection_service` | Boolean | Enables a reflection service for Protobuf services (see [ProtoReflectionService](https://grpc.github.io/grpc-java/javadoc/io/grpc/protobuf/services/ProtoReflectionService.html) and [gRPC reflection](https://github.com/grpc/grpc-java/blob/master/documentation/server-reflection-tutorial.md)). Default is `false`. | | ||
| | `unframed_requests` | Boolean | Enables requests not framed using the gRPC wire protocol. Default is `false`. | | ||
| | `thread_count` | Integer | The number of threads to keep in the scheduled thread pool. Default is `200`. | | ||
| | `max_connection_count` | Integer | The maximum allowed number of open connections. Default is `500`. | | ||
| | `max_request_length` | String | The maximum number of bytes allowed in the payload of a single gRPC or HTTP request. Default is `10mb`. | | ||
| | `compression` | String | The compression type applied on the client request payload. Valid values are `none` (no compression) or `gzip` (apply `gzip` decompression). Default is `none`. | | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
| | `output_format` | String | Specifies the decoded output format for all signals (logs, metrics, and traces) if individual output format options are not set. Valid values are `otel` (OpenTelemetry format) and `opensearch` (OpenSearch format). Default is `otel`. | | ||
| | `logs_output_format` | String | Specifies the decoded output format specifically for logs. Takes precedence over `output_format` for logs. Valid values are `otel` and `opensearch`. Default is `otel`. | | ||
| | `metrics_output_format` | String | Specifies the decoded output format specifically for metrics. Takes precedence over `output_format` for metrics. Valid values are `otel` and `opensearch`. Default is `otel`. | | ||
| | `traces_output_format` | String | Specifies the decoded output format specifically for traces. Takes precedence over `output_format` for traces. Valid values are `otel` and `opensearch`. Default is `otel`. | | ||
|  | ||
| If an individual output format (for example, `logs_output_format`) is set, it takes precedence over the generic `output_format` for that signal type. If neither is set, the default is `otel`. | ||
| {: .note} | ||
|  | ||
| ### SSL/TLS configuration | ||
|  | ||
| You can configure SSL/TLS in the `otlp` source with the following options. | ||
|  | ||
| | Option | Type | Description | | ||
| | :--- | :--- | :--- | | ||
| | `ssl` | Boolean | Enables TLS/SSL. Default is `true`. | | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
| | `ssl_certificate_file` | String | The SSL certificate chain file path or Amazon S3 path (for example, `s3://<bucketName>/<path>`). Required if `ssl` is set to `true`. | | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
| | `ssl_key_file` | String | The SSL key file path or Amazon S3 path (for example, `s3://<bucketName>/<path>`). Required if `ssl` is set to `true`. | | ||
| | `use_acm_cert_for_ssl` | Boolean | Enables TLS/SSL using a certificate and private key from AWS Certificate Manager (ACM). Default is `false`. | | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
| | `acm_certificate_arn` | String | The ACM certificate Amazon Resource Name (ARN). ACM certificates take precedence over Amazon S3 or local file system certificates. Required if `use_acm_cert_for_ssl` is set to `true`. | | ||
| | `acm_private_key_password` | String | The ACM private key password that decrypts the private key. If not provided, Data Prepper uses the private key unencrypted. | | ||
| Check failure on line 54 in _data-prepper/pipelines/configuration/sources/otlp-source.md 
     | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
| | `aws_region` | String | The AWS Region used by ACM or Amazon S3. Required if `use_acm_cert_for_ssl` is set to `true` or if `ssl_certificate_file` and `ssl_key_file` are Amazon S3 paths. | | ||
|  | ||
| ### Authentication configuration | ||
|  | ||
| By default, the OTLP source runs without authentication. You can configure authentication using the following options. | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
|  | ||
| To explicitly disable authentication, specify the following settings: | ||
|  | ||
| ```yaml | ||
| source: | ||
| otlp: | ||
| authentication: | ||
| unauthenticated: | ||
| ``` | ||
| {% include copy.html %} | ||
|  | ||
| To enable HTTP Basic authentication, specify the following settings: | ||
|  | ||
| ```yaml | ||
| source: | ||
| otlp: | ||
| authentication: | ||
| http_basic: | ||
| username: my-user | ||
| password: my_s3cr3t | ||
| ``` | ||
| {% include copy.html %} | ||
|  | ||
| This plugin uses pluggable authentication for gRPC servers. To provide custom authentication, create a plugin that implements [`GrpcAuthenticationProvider`](https://github.com/opensearch-project/data-prepper/blob/main/data-prepper-plugins/armeria-common/src/main/java/org/opensearch/dataprepper/armeria/authentication/GrpcAuthenticationProvider.java). | ||
|  | ||
| ### Retry information | ||
|  | ||
| You can set retry behavior using the `retry_info` setting, specifying how long to wait for the next request when backpressure occurs. The retry mechanism applies exponential backoff with a configurable maximum delay: | ||
|  | ||
| ```yaml | ||
| source: | ||
| otlp: | ||
| retry_info: | ||
| min_delay: 100ms # defaults to 100ms | ||
| max_delay: 2s # defaults to 2s | ||
| ``` | ||
| {% include copy.html %} | ||
|  | ||
| ## Usage | ||
|  | ||
| The following examples demonstrate how to configure and use the OTLP source in various scenarios. | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
|  | ||
| ### Basic configuration | ||
|  | ||
| To get started with the `otlp` source, create a `pipeline.yaml` file with the following minimal configuration: | ||
|  | ||
| ```yaml | ||
| pipeline: | ||
| source: | ||
| otlp: | ||
| ssl: false | ||
| sink: | ||
| - stdout: | ||
| ``` | ||
| {% include copy.html %} | ||
|  | ||
| ### Routing telemetry signals | ||
|  | ||
| One of the key features of the OTLP source is the ability to route different telemetry signals (logs, metrics, and traces) to different processors or sinks based on your specific needs. Routing is determined by metadata using the getEventType() function: | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
|  | ||
| ```yaml | ||
| version: "2" | ||
| otel-telemetry: | ||
| source: | ||
| otlp: | ||
| ssl: false | ||
| route: | ||
| - traces: 'getEventType() == "TRACE"' | ||
| - logs: 'getEventType() == "LOG"' | ||
| - metrics: 'getEventType() == "METRIC"' | ||
| sink: | ||
| - opensearch: | ||
| routes: | ||
| - logs | ||
| hosts: [ "https://opensearch:9200" ] | ||
| index: logs-%{yyyy.MM.dd} | ||
| username: admin | ||
| password: yourStrongPassword123! | ||
| insecure: true | ||
| - pipeline: | ||
| name: traces-raw | ||
| routes: | ||
| - traces | ||
| - pipeline: | ||
| name: otel-metrics | ||
| routes: | ||
| - metrics | ||
|  | ||
| traces-raw: | ||
| source: | ||
| pipeline: | ||
| name: otel-telemetry | ||
| processor: | ||
| - otel_trace_raw: | ||
| sink: | ||
| - opensearch: | ||
| hosts: [ "https://opensearch:9200" ] | ||
| index_type: trace-analytics-raw | ||
| username: admin | ||
| password: yourStrongPassword123! | ||
| insecure: true | ||
|  | ||
| otel-metrics: | ||
| source: | ||
| pipeline: | ||
| name: otel-telemetry | ||
| processor: | ||
| - otel_metrics: | ||
| calculate_histogram_buckets: true | ||
| calculate_exponential_histogram_buckets: true | ||
| exponential_histogram_max_allowed_scale: 10 | ||
| flatten_attributes: false | ||
| sink: | ||
| - opensearch: | ||
| hosts: [ "https://opensearch:9200" ] | ||
| index: metrics-otel-%{yyyy.MM.dd} | ||
| username: admin | ||
| password: yourStrongPassword123! | ||
| insecure: true | ||
| ``` | ||
| {% include copy.html %} | ||
|  | ||
| ### Using OpenSearch output format | ||
|  | ||
| To generate data in the OpenSearch format for all telemetry signals, specify the following settings: | ||
|  | ||
| ```yaml | ||
| source: | ||
| otlp: | ||
| output_format: opensearch | ||
| ``` | ||
| {% include copy.html %} | ||
|  | ||
| To use different output formats for different signal types, specify the following settings: | ||
|  | ||
| ```yaml | ||
| source: | ||
| otlp: | ||
| logs_output_format: opensearch | ||
| metrics_output_format: otel | ||
| traces_output_format: opensearch | ||
| ``` | ||
| {% include copy.html %} | ||
|  | ||
| ### Configuring with SSL/TLS | ||
|  | ||
| To enable SSL/TLS with local certificates, specify the following settings: | ||
|  | ||
| ```yaml | ||
| source: | ||
| otlp: | ||
| ssl: true | ||
| ssl_certificate_file: "/path/to/certificate.crt" | ||
| ssl_key_file: "/path/to/private-key.key" | ||
| ``` | ||
| {% include copy.html %} | ||
|  | ||
| To use the AWS Certificate Manager, specify the following settings: | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
|  | ||
| ```yaml | ||
| source: | ||
| otlp: | ||
| ssl: true | ||
| use_acm_cert_for_ssl: true | ||
| acm_certificate_arn: "arn:aws:acm:us-east-1:123456789012:certificate/12345678-1234-1234-1234-123456789012" | ||
| aws_region: "us-east-1" | ||
| ``` | ||
| {% include copy.html %} | ||
|  | ||
| ## Metrics | ||
|  | ||
| The `otlp` source includes the following metrics to monitor its performance and health. | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
|  | ||
| ### Counters | ||
|  | ||
| The following counters track request activity and errors in the OTLP source. | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
|  | ||
| | Metric | Description | | ||
| | :--- | :--- | | ||
| | `requestTimeouts` | The total number of requests that timed out. | | ||
| | `requestsReceived` | The total number of requests received by the OTLP source. | | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
| | `successRequests` | The total number of requests successfully processed by the OTLP source. | | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
| | `badRequests` | The total number of requests with invalid format processed by the OTLP source. | | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
| | `requestsTooLarge` | The total number of requests that exceed the maximum allowed size. | | ||
| | `internalServerError` | The total number of requests processed by the OTLP source with custom exception types. | | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
|  | ||
| ### Timers | ||
|  | ||
| The following timers track request activity and errors in the OTLP source. | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
|  | ||
| | Metric | Description | | ||
| | :--- | :--- | | ||
| | `requestProcessDuration` | The latency of requests processed by the OTLP source, in seconds. | | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
|  | ||
| ### Distribution summaries | ||
|  | ||
| The following distribution summaries track request activity and errors in the OTLP source. | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
|  | ||
| | Metric | Description | | ||
| | :--- | :--- | | ||
| | `payloadSize` | The distribution of incoming request payload sizes, in bytes. | | ||
|  | ||
| ## Migrating from individual OTel sources | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
|  | ||
| If you're using separate `otel_logs_source`, `otel_metrics_source`, or `otel_trace_source` sources, you can migrate to the unified OTLP source by following these steps: | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
|  | ||
| 1. Replace all three sources with a single `otlp` source. | ||
| 2. Use [routing configuration](#routing-telemetry-signals) to direct different signal types to their appropriate pipelines. | ||
| 3. Change the port numbers if needed (the OTLP source uses port `21893` by default). | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
|  | ||
| ### Migration example | ||
|  | ||
| The following example demonstrates how to consolidate separate OTel logs, metrics, and traces sources into a single OTLP source with routing. | ||
|         
                  kolchfa-aws marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
|  | ||
| Consider a setup where logs, metrics, and traces are configured separately: | ||
|  | ||
| ```yaml | ||
| logs-pipeline: | ||
| source: | ||
| otel_logs_source: | ||
| port: 21892 | ||
| sink: | ||
| - opensearch: | ||
| index: logs | ||
| ``` | ||
| ```yaml | ||
| metrics-pipeline: | ||
| source: | ||
| otel_metrics_source: | ||
| port: 21891 | ||
| sink: | ||
| - opensearch: | ||
| index: metrics | ||
| ``` | ||
| ```yaml | ||
| traces-pipeline: | ||
| source: | ||
| otel_trace_source: | ||
| port: 21890 | ||
| sink: | ||
| - opensearch: | ||
| index: traces | ||
| ``` | ||
|  | ||
| You can consolidate logs, metrics, and traces into a single OTLP source as follows: | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
|  | ||
| ```yaml | ||
| otlp-pipeline: | ||
| source: | ||
| otlp: | ||
| port: 21893 | ||
| route: | ||
| - logs: 'getEventType() == "LOG"' | ||
| - metrics: 'getEventType() == "METRIC"' | ||
| - traces: 'getEventType() == "TRACE"' | ||
| sink: | ||
| - opensearch: | ||
| routes: | ||
| - logs | ||
| index: logs | ||
| - opensearch: | ||
| routes: | ||
| - metrics | ||
| index: metrics | ||
| - opensearch: | ||
| routes: | ||
| - traces | ||
| index: traces | ||
| ``` | ||
|         
                  kolchfa-aws marked this conversation as resolved.
              Show resolved
            Hide resolved | ||
|  | ||
| ## Related articles | ||
|         
                  natebower marked this conversation as resolved.
              Outdated
          
            Show resolved
            Hide resolved | ||
|  | ||
| - [OTel logs source]({{site.url}}{{site.baseurl}}/data-prepper/pipelines/configuration/sources/otel-logs-source/) | ||
| - [OTel metrics source]({{site.url}}{{site.baseurl}}/data-prepper/pipelines/configuration/sources/otel-metrics-source/) | ||
| - [OTel trace source]({{site.url}}{{site.baseurl}}/data-prepper/pipelines/configuration/sources/otel-trace-source/) | ||
| - [getEventType()]({{site.url}}{{site.baseurl}}/data-prepper/pipelines/get-eventtype/) | ||
      
      Oops, something went wrong.
        
    
  
      
      Oops, something went wrong.
        
    
  
  Add this suggestion to a batch that can be applied as a single commit.
  This suggestion is invalid because no changes were made to the code.
  Suggestions cannot be applied while the pull request is closed.
  Suggestions cannot be applied while viewing a subset of changes.
  Only one suggestion per line can be applied in a batch.
  Add this suggestion to a batch that can be applied as a single commit.
  Applying suggestions on deleted lines is not supported.
  You must change the existing code in this line in order to create a valid suggestion.
  Outdated suggestions cannot be applied.
  This suggestion has been applied or marked resolved.
  Suggestions cannot be applied from pending reviews.
  Suggestions cannot be applied on multi-line comments.
  Suggestions cannot be applied while the pull request is queued to merge.
  Suggestion cannot be applied right now. Please check back later.
  
    
  
    
Uh oh!
There was an error while loading. Please reload this page.