LAPP - Log Auto Pattern Pipeline

LAPP turns raw log files into a structured investigation workspace for humans and AI agents.

It clusters repeated log lines with Drain, asks an LLM to assign semantic IDs to repeated patterns, then writes a file-based workspace with pattern notes, samples, error summaries, and agent instructions.

Quick Start

# Build
make build

# Clean, rebuild, and start the local web app
make dev

# Create a workspace
go run ./cmd/lapp/ workspace create app-incident

# Start the local web app
go run ./cmd/lapp/ web

# Add logs to the workspace and run discovery
go run ./cmd/lapp/ workspace add-log --topic app-incident /var/log/syslog

# AI-powered analysis (agent backend via ACP provider)
go run ./cmd/lapp/ workspace analyze --topic app-incident "why are there connection timeouts?" --acp claude
go run ./cmd/lapp/ workspace analyze --topic app-incident "what failed?" --acp codex

How It Works

The current CLI product is a structured file workspace under ~/.lapp/workspaces/<topic>/.

workspace add-log
  -> copy input into logs/
  -> read all files in logs/
  -> merge multiline entries
  -> discover repeated Drain patterns
  -> label repeated patterns with retried LLM batches
  -> write discovery-runs/<run-id>/patterns/, notes/, and AGENTS.md

Core idea: Drain clusters logs into templates cheaply (no API cost), then LLM semantifies the templates in bounded batches. This follows the IBM "Label Broadcasting" pattern — cluster first (90%+ volume reduction), apply LLM to representatives, broadcast labels back.

Generated workspace layout:

~/.lapp/workspaces/<topic>/
  logs/                         # copied source logs
  discovery-runs/<run-id>/
    run.json                    # status, progress, summary counts
    patterns/<semantic-id>/
      pattern.md                # metadata, counts, source line refs
      samples.log               # representative matching lines
    patterns/unmatched/samples.log
    notes/summary.md            # frequency-sorted overview
    notes/errors.md             # error/warning-focused view
    AGENTS.md                   # instructions for AI-assisted investigation
  AGENTS.md                     # initial workspace note before discovery

DiscoveryRuns are one-time local tasks. If lapp web starts and finds a previous QUEUED or RUNNING run on disk, it marks that run as failed because no worker is still attached to it.

Environment Variables

OPENROUTER_API_KEY: Required for semantic labeling in workspace add-log
MODEL_NAME: Override default LLM model (default: google/gemini-3-flash-preview)
Provider-specific auth for ACP agent CLI (for example Claude/Codex/Gemini CLI login credentials)
.env file is auto-loaded

Commands

Command	Description
`workspace create <topic>`	Create a workspace under `~/.lapp/workspaces/`
`workspace list`	List all workspace topics
`workspace add-log --topic <topic> <file>`	Add log file and run discovery
`workspace analyze --topic <topic> [question]`	Run AI analysis (`--acp claude

Event Schema

The initial normalized event contract is defined in proto/lapp/event/v1/event.proto and documented in docs/event-schema-v1.md. Representative fixtures live under fixtures/events/v1/ for JSON, logfmt, key=value, and plain text logs.

The event parser and DuckDB store packages are library-level building blocks. They are covered by tests and integration tests, but workspace add-log currently writes DiscoveryRun results directly into the local workspace instead of persisting through DuckDB first.

Development

make build                           # Build embedded web assets and output/lapp
make clean                           # Remove generated build artifacts
make dev                             # Clean, build, and start lapp web on 127.0.0.1:8080
make proto-gen                       # Generate protobuf/Connect code
make test                            # Run unit and integration tests
make check                           # Run formatting, linting, type checks, build, and unit tests

LOGHUB_PATH=/path/to/2k_dataset \
  go test -v ./integration_test/...  # Integration tests (14 Loghub-2.0 datasets)

Override the local web address with WEB_ADDR=127.0.0.1:3000 make dev.

Roadmap

See Issue #2 for full vision and progress.

Current

Structured workspace generation, multiline merging, Drain pattern discovery, LLM semantic labeling, ACP-backed agent analysis, event parsing, DuckDB storage primitives, and Loghub integration tests.

Next: Log Viewer

Color-coded log viewer with template filtering. Each semantic template gets a distinct color. Unmatched/leftover logs shown in gray.

Future

Persist discovery inputs/results through the event store
Iterative refinement (re-discover patterns from leftover)
Per-template statistics and trend detection
Real-time discovery progress
MCP server for LLM agent access

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.github/workflows		.github/workflows
cmd/lapp		cmd/lapp
docs		docs
fixtures/events/v1		fixtures/events/v1
frontend		frontend
gen/go/lapp		gen/go/lapp
integration_test		integration_test
pkg		pkg
proto/lapp		proto/lapp
scripts		scripts
skills/lapp-log-analysis		skills/lapp-log-analysis
.env.example		.env.example
.gitignore		.gitignore
.golangci.yml		.golangci.yml
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
CONTEXT.md		CONTEXT.md
LICENSE		LICENSE
Makefile		Makefile
NOTICE		NOTICE
README.md		README.md
buf.gen.yaml		buf.gen.yaml
buf.yaml		buf.yaml
docker-compose.yml		docker-compose.yml
go.mod		go.mod
go.sum		go.sum
prek.toml		prek.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LAPP - Log Auto Pattern Pipeline

Quick Start

How It Works

Environment Variables

Commands

Event Schema

Development

Roadmap

Current

Next: Log Viewer

Future

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LAPP - Log Auto Pattern Pipeline

Quick Start

How It Works

Environment Variables

Commands

Event Schema

Development

Roadmap

Current

Next: Log Viewer

Future

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages