Skip to content

Conversation

@betsyecastro
Copy link
Contributor

@betsyecastro betsyecastro commented Jul 22, 2025

  1. Updates ORCID data source to use the V3 API, replacing the previous V2 integration for improved data accuracy and better grouping of the records.

  2. Updates criteria to search and match incoming publications from the API to existing records:

  • Allows manual editing of publications after automatic import.
  • Searches by title and publication year to find exact, partial, or similar matches.
  • Attempts to match by exact identifier (DOI or EID) first; if none are found, it then searches for the identifier within the URL.
  • If no match is found via identifier, it falls back to the initial title and year search, if any results exist.
  • If a matching publication is found, it updates the existing record with new data; otherwise, it creates a new one.
  • Implements logic to calculate the sort_order field in reverse chronological order (by year, month, and day).
  • Tracks and logs the number of publications updated, created, and marked as similar/duplicates for future cleanup.
image

@betsyecastro betsyecastro requested a review from wunc July 22, 2025 14:16
@betsyecastro betsyecastro self-assigned this Jul 22, 2025
@betsyecastro betsyecastro added the ✨ enhancement New feature or request label Jul 22, 2025
@betsyecastro betsyecastro changed the title Refactor ORCID Disambiguation Logic to Update Current Publications Refactor ORCID Disambiguation Logic to Update Existing Publications Jul 22, 2025
@betsyecastro betsyecastro force-pushed the orcid-import-disambiguation branch from 06b421d to e427b6b Compare August 4, 2025 18:44
@betsyecastro
Copy link
Contributor Author

After refactoring to group similar (non-exact match) publications and importing a more up-to-date backup, the results are as follows:
image

@betsyecastro betsyecastro added the ⬆️ priority:high High priority issue label Sep 26, 2025
@shukla-m shukla-m self-requested a review November 18, 2025 16:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

✨ enhancement New feature or request ⬆️ priority:high High priority issue

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants