Skip to content

Jongmassey/port icd10 scrape#3129

Open
Jongmassey wants to merge 2 commits into
icd-10-multi-editionsfrom
Jongmassey/port-icd10-scrape
Open

Jongmassey/port icd10 scrape#3129
Jongmassey wants to merge 2 commits into
icd-10-multi-editionsfrom
Jongmassey/port-icd10-scrape

Conversation

@Jongmassey

Copy link
Copy Markdown
Contributor

Move icd browser scraper and claml converter code from bennettoxford/icd-browser-scraper into opencodelists for production scraping purposes.

The source project uses lxml for XML wrangling, but closer inspection shows the base python xml package will suffice and so this code is changed to use the latter as not to add another dependency.

Both the scraping and conversion scripts were written with the expectation of the browser scraper project structure and to be called as standalone scripts from the CLI. They are changed here such that they have main entry functions that can be easily called from other scripts as part of the ICD-10 import process.

@rw251 rw251 force-pushed the icd-10-multi-editions branch from 4f5d517 to ba46b96 Compare June 26, 2026 08:41
@Jongmassey Jongmassey force-pushed the Jongmassey/port-icd10-scrape branch from ba13092 to c655aee Compare June 29, 2026 20:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant