Skip to content

Conversation

@bgyori
Copy link
Member

@bgyori bgyori commented Sep 28, 2022

This PR implements an API and processor for the EVEX text mining database (http://evexdb.org/). The approach is to use the "network" relations file as the backbone of INDRA Statement extraction. However, to gather evidence text, raw agent text and coordinates, and other metadata, it is necessary to find support for each relation in raw standoff output files. This latter part is considerably complicated since network relations aren't explicitly linked to elements of the standoff output. The processor produces around 620k Statements (each with a single Evidence), and more than 99% of these contain evidence text meaning these were successfully identifier from standoff files.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant