Please enable JavaScript.
Coggle requires JavaScript to display documents.
Large-Scale Event Extraction from Literature with Multi-Level Gene…
-
我们要做的是event link gene ontology(event normalization).
以下问题首先需要被明确:
What is event? How is it extracted and stored(event extraction都有哪些方法,extract出的event是怎样的)?
What is gene ontology?
How to connect those 2 things? Are there any methods of connection whether in Bio or general for inspiration?
使用的什么数据?All methods are run on al PubMed abstracts and PubMed Central open access full texts, resulting in a unique dataset for text mining.
The canonical forms provide a powerful way to query textual representations of events through symbol search, dealing with lexical variation of gene symbols. 所以canonical form 是用来query EVEX的?
Does EVEX database have the family data of normalized gene Ids? Yes, Ensembl, HomoloGene, Ensembl Genomes
牵强,第一个问题:GenNorm在assign gene id时是否已经用过canonical form这个信息了?如果是,这就自相矛盾了。第二个问题:万一这个family还是包含了这些multiple ID呢,这方法就黄了。
inter-species ambiguity: One name,
abbreviation or code may refer to genes in multiple species, each with its own unique ID, or even to multiple genes in the same species or across different species.
-