Skip to content
View schrodingerkitkat's full-sized avatar

Highlights

  • Pro

Block or report schrodingerkitkat

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
schrodingerkitkat/README.md

Katherine M

Lead AI Engineer · Los Angeles

Building production AI systems — from on-device ML to cloud-scale data infrastructure. Currently focused on RAG architectures, real-time video processing, and the end-to-end tooling that makes AI products actually ship.


What I Build

Consumer AI Products
iOS apps with on-device ML (CoreML, Vision) — real-time video processing, intelligent editing, and the kind of fluid UX that makes AI feel invisible.

Retrieval-Augmented Generation
Production RAG pipelines for complex document analysis — vector search, citation traceability, semantic chunking that works on messy real-world data.

Event-Driven Data Platforms
Dagster-orchestrated lakehouses on Databricks. High-velocity ingestion from social APIs. The unsexy infrastructure that makes analytics possible.

Developer Tools
CLI-native monitoring, voice-to-code interfaces, and the small sharp tools that make engineering less painful.


Selected Work

Domain Stack Notes
AI Video Platform Swift · CoreML · Vision · AWS Full-stack consumer app — on-device ML for real-time video analysis, serverless processing pipeline
Legal Discovery RAG Python · LangChain · AWS Document analysis with citation-level traceability
Social Analytics Lakehouse Databricks · Dagster · Spark Medallion architecture for real-time social metrics
Infrastructure Health TUI Go · Bubble Tea AWS monitoring for the terminal
Real-Time Social Ingestion Python · Lambda · EventBridge Sub-minute data capture from Meta APIs — the pipes that feed the lakehouse

Most repositories are private.


Stack

Languages: Python, Swift, Go, SQL, HCL
ML/AI: CoreML, Vision, LangChain, PyTorch
Infrastructure: AWS (Lambda, Step Functions, ECS), Databricks, Terraform
Data: Dagster, dbt, Spark
Learning: Rust, more Go, whatever's next


Open to interesting problems.

Popular repositories Loading

  1. quantum_etl quantum_etl Public

    QuantumETL is a cutting-edge data engineering framework that harnesses the power of quantum-inspired algorithms to optimize complex ETL (Extract, Transform, Load) workflows. By leveraging a hybrid …

    Python 1

  2. tps_parser tps_parser Public

    Read and write to .tps files

    Python

  3. boat_eta boat_eta Public

    Pyspark: DE/AE. Fetches boat locations and etas to a port and creates a map viz

    Python

  4. csv_processor csv_processor Public

    Python

  5. KoalaSis_Processor KoalaSis_Processor Public

    'Efficient Data Processing and Transformation Pipeline for KoalaSis Student Demographics and Enrollment Information

    Python

  6. TextractChemComp TextractChemComp Public

    Extracting chemical composition data from PDF documents, processing the data in AWS using boto3, s3, and Textract, and then loading it into a SQL database.

    Python