Skip to content

[codex] Add Cubox platform adapters and Chinese tagging#2775

Closed
lichengtao97 wants to merge 3 commits into
karakeep-app:mainfrom
lichengtao97:codex/cubox-m1-m2
Closed

[codex] Add Cubox platform adapters and Chinese tagging#2775
lichengtao97 wants to merge 3 commits into
karakeep-app:mainfrom
lichengtao97:codex/cubox-m1-m2

Conversation

@lichengtao97
Copy link
Copy Markdown

Summary

Implements the first Cubox-alignment slice for Karakeep:

  • Adds a worker-side platform adapter framework with a WeChat adapter.
  • Routes matched URLs through adapters before falling back to the generic crawler.
  • Stores platform extraction metadata and adapter logs in the database.
  • Downloads WeChat cover and inline images into assetdb with the required Referer and rewrites article HTML to local asset URLs.
  • Adds Chinese-first AI tagging prompts using platform metadata and Chinese tag synonym normalization.
  • Keeps the existing post-crawl pipeline intact for tagging, summarization, search reindexing, webhooks, and video handling.

Validation

Focused checks run locally:

  • npx pnpm@9.15.9 --filter @karakeep/workers run format
  • npx pnpm@9.15.9 --filter @karakeep/workers run lint
  • npx pnpm@9.15.9 --filter @karakeep/workers run typecheck
  • npx pnpm@9.15.9 --filter @karakeep/workers run test
  • npx pnpm@9.15.9 --filter @karakeep/shared run format
  • npx pnpm@9.15.9 --filter @karakeep/shared run lint
  • npx pnpm@9.15.9 --filter @karakeep/shared run typecheck
  • npx pnpm@9.15.9 --filter @karakeep/shared run test
  • npx pnpm@9.15.9 --filter @karakeep/db run typecheck
  • npx pnpm@9.15.9 --filter @karakeep/trpc run typecheck

Manual verification:

  • Started local web and workers with SQLite data dir.
  • Added WeChat public account articles and verified adapter extraction, inline image asset rewriting, and AI tagging via an OpenAI-compatible Zhipu endpoint.

Notes

The full workspace typecheck still has an unrelated blocker in apps/landing: Astro requires Node >=22.12.0 while this local shell is using Node 20.17.0.

@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai Bot commented May 9, 2026

Important

Review skipped

Draft detected.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: cb67a9da-38af-43b5-921d-44e3c878121f

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

  • 🔍 Trigger review

Tip

💬 Introducing Slack Agent: The best way for teams to turn conversations into code.

Slack Agent is built on CodeRabbit's deep understanding of your code, so your team can collaborate across the entire SDLC without losing context.

  • Generate code and open pull requests
  • Plan features and break down work
  • Investigate incidents and troubleshoot customer tickets together
  • Automate recurring tasks and respond to alerts with triggers
  • Summarize progress and report instantly

Built for teams:

  • Shared memory across your entire org—no repeating context
  • Per-thread sandboxes to safely plan and execute work
  • Governance built-in—scoped access, auditability, and budget controls

One agent for your entire SDLC. Right inside Slack.

👉 Get started


Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@MohamedBassem
Copy link
Copy Markdown
Collaborator

I'm not sure what exactly is going on here but please make sure you read the contribution guidelines of karakeep before sending PRs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants