<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet href="http://onlinelibrarystatic.wiley.com/xslt/wol-journal-rss.xsl"
            type="text/xsl"?><rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"><channel rdf:about="http://onlinelibrary.wiley.com/rss/journal/10.1111/(ISSN)1467-8640" xmlns="http://purl.org/rss/1.0/"><title>Computational Intelligence</title><description> Wiley Online Library : Computational Intelligence</description><link>http://dx.doi.org/10.1111%2F%28ISSN%291467-8640</link><dc:publisher xmlns:dc="http://purl.org/dc/elements/1.1/">John Wiley &amp; Sons, Inc</dc:publisher><dc:language xmlns:dc="http://purl.org/dc/elements/1.1/">en</dc:language><dc:rights xmlns:dc="http://purl.org/dc/elements/1.1/">© 2012 Wiley Periodicals, Inc.</dc:rights><prism:issn xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">0824-7935</prism:issn><prism:eIssn xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">1467-8640</prism:eIssn><dc:date xmlns:dc="http://purl.org/dc/elements/1.1/">2011-11-01T00:00:00-05:00</dc:date><prism:coverDisplayDate xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">November 2011</prism:coverDisplayDate><prism:volume xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">27</prism:volume><prism:number xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">4</prism:number><prism:startingPage xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">513</prism:startingPage><prism:endingPage xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">721</prism:endingPage><image rdf:resource="http://onlinelibrary.wiley.com/store/10.1111/coin.2011.27.issue-4/asset/cover.gif?v=1&amp;s=07883edef6b82a89f527897809b214d3714aac4d"/><items><rdf:Seq><rdf:li rdf:resource="http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00398.x"/><rdf:li rdf:resource="http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00399.x"/><rdf:li rdf:resource="http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00400.x"/><rdf:li rdf:resource="http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00401.x"/><rdf:li rdf:resource="http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00402.x"/><rdf:li rdf:resource="http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00403.x"/><rdf:li rdf:resource="http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00404.x"/><rdf:li rdf:resource="http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00405.x"/><rdf:li rdf:resource="http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00406.x"/></rdf:Seq></items></channel><item rdf:about="http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00398.x" xmlns="http://purl.org/rss/1.0/"><title>EXTRACTING BIO-MOLECULAR EVENTS FROM LITERATURE—THE BIONLP’09 SHARED TASK</title><link>http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00398.x</link><dc:title xmlns:dc="http://purl.org/dc/elements/1.1/">EXTRACTING BIO-MOLECULAR EVENTS FROM LITERATURE—THE BIONLP’09 SHARED TASK</dc:title><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jin-Dong Kim</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Tomoko Ohta</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Sampo Pyysalo</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Yoshinobu Kano</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jun’ichi Tsujii</dc:creator><dc:date xmlns:dc="http://purl.org/dc/elements/1.1/">2011-11-01T00:00:00-05:00</dc:date><dc:identifier xmlns:dc="http://purl.org/dc/elements/1.1/">doi:10.1111/j.1467-8640.2011.00398.x</dc:identifier><dc:rights xmlns:dc="http://purl.org/dc/elements/1.1/"/><dc:publisher xmlns:dc="http://purl.org/dc/elements/1.1/">John Wiley &amp; Sons, Inc.</dc:publisher><prism:doi xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">10.1111/j.1467-8640.2011.00398.x</prism:doi><prism:url xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00398.x</prism:url><prism:startingPage xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">513</prism:startingPage><prism:endingPage xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">540</prism:endingPage><content:encoded xmlns:content="http://purl.org/rss/1.0/modules/content/"><![CDATA[<div class="para" xmlns:ol="http://www.wiley.com/namespaces/ol/xsl-lib" xmlns="http://www.w3.org/1999/xhtml"><p>This paper presents the preparation, results and analysis of the BioNLP’09 shared task on event extraction, organized to address the automatic extraction of fine-grained information from the scientific literature on molecular biology. A representation of this information was defined taking into account both the biological and computational requirements of the task, and corpus resources manually annotated by domain experts provided to task participants. To create a basis for further progress, emphasis was placed on providing fine-grained evaluation that isolates different subtasks and allows the analysis of different aspects of the results through various evaluation criteria. In introducing this new task to the community, we made an effort to reduce the cost of participation by making common natural language processing tools, data, and evaluation methods easily accessible. The task received community-wide participation, establishing the state-of-the-art performance at fine-grained event extraction as well as allowing the identification of remaining challenges and suggesting directions for future improvements. All the resources and results of the shared task are publicly available and an online evaluation on blind test data accessible at <!--TODO: clickthrough URL--><a href="http://www-tsujii.is.s.u-tokyo.ac.jp/GENIA/SharedTask/" title="Link to external resource: http://www-tsujii.is.s.u-tokyo.ac.jp/GENIA/SharedTask/">http://www-tsujii.is.s.u-tokyo.ac.jp/GENIA/SharedTask/</a>.</p></div>]]></content:encoded><description>This paper presents the preparation, results and analysis of the BioNLP’09 shared task on event extraction, organized to address the automatic extraction of fine-grained information from the scientific literature on molecular biology. A representation of this information was defined taking into account both the biological and computational requirements of the task, and corpus resources manually annotated by domain experts provided to task participants. To create a basis for further progress, emphasis was placed on providing fine-grained evaluation that isolates different subtasks and allows the analysis of different aspects of the results through various evaluation criteria. In introducing this new task to the community, we made an effort to reduce the cost of participation by making common natural language processing tools, data, and evaluation methods easily accessible. The task received community-wide participation, establishing the state-of-the-art performance at fine-grained event extraction as well as allowing the identification of remaining challenges and suggesting directions for future improvements. All the resources and results of the shared task are publicly available and an online evaluation on blind test data accessible at http://www-tsujii.is.s.u-tokyo.ac.jp/GENIA/SharedTask/.</description></item><item rdf:about="http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00399.x" xmlns="http://purl.org/rss/1.0/"><title>EXTRACTING CONTEXTUALIZED COMPLEX BIOLOGICAL EVENTS WITH RICH GRAPH-BASED FEATURE SETS</title><link>http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00399.x</link><dc:title xmlns:dc="http://purl.org/dc/elements/1.1/">EXTRACTING CONTEXTUALIZED COMPLEX BIOLOGICAL EVENTS WITH RICH GRAPH-BASED FEATURE SETS</dc:title><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jari Björne</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Juho Heimonen</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Filip Ginter</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Antti Airola</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Tapio Pahikkala</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Tapio Salakoski</dc:creator><dc:date xmlns:dc="http://purl.org/dc/elements/1.1/">2011-11-01T00:00:00-05:00</dc:date><dc:identifier xmlns:dc="http://purl.org/dc/elements/1.1/">doi:10.1111/j.1467-8640.2011.00399.x</dc:identifier><dc:rights xmlns:dc="http://purl.org/dc/elements/1.1/"/><dc:publisher xmlns:dc="http://purl.org/dc/elements/1.1/">John Wiley &amp; Sons, Inc.</dc:publisher><prism:doi xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">10.1111/j.1467-8640.2011.00399.x</prism:doi><prism:url xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00399.x</prism:url><prism:startingPage xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">541</prism:startingPage><prism:endingPage xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">557</prism:endingPage><content:encoded xmlns:content="http://purl.org/rss/1.0/modules/content/"><![CDATA[<div class="para" xmlns:ol="http://www.wiley.com/namespaces/ol/xsl-lib" xmlns="http://www.w3.org/1999/xhtml"><p>We describe a system for extracting complex events among genes and proteins from biomedical literature, developed in context of the BioNLP’09 Shared Task on Event Extraction. For each event, the system extracts its text trigger, class, and arguments. In contrast to the approaches prevailing prior to the shared task, events can be arguments of other events, resulting in a nested structure that better captures the underlying biological statements. We divide the task into independent steps which we approach as machine learning problems. We define a wide array of features and in particular make extensive use of dependency parse graphs. A rule-based postprocessing step is used to refine the output in accordance with the restrictions of the extraction task. In the shared task evaluation, the system achieved an F-score of 51.95% on the primary task, the best performance among the participants. Currently, with modifications and improvements described in this article, the system achieves 52.86% F-score on Task 1, the primary task, improving on its original performance. In addition, we extend the system also to Tasks 2 and 3, gaining F-scores of 51.28% and 50.18%, respectively. The system thus addresses the BioNLP’09 Shared Task in its entirety and achieves the best performance on all three subtasks.</p></div>]]></content:encoded><description>We describe a system for extracting complex events among genes and proteins from biomedical literature, developed in context of the BioNLP’09 Shared Task on Event Extraction. For each event, the system extracts its text trigger, class, and arguments. In contrast to the approaches prevailing prior to the shared task, events can be arguments of other events, resulting in a nested structure that better captures the underlying biological statements. We divide the task into independent steps which we approach as machine learning problems. We define a wide array of features and in particular make extensive use of dependency parse graphs. A rule-based postprocessing step is used to refine the output in accordance with the restrictions of the extraction task. In the shared task evaluation, the system achieved an F-score of 51.95% on the primary task, the best performance among the participants. Currently, with modifications and improvements described in this article, the system achieves 52.86% F-score on Task 1, the primary task, improving on its original performance. In addition, we extend the system also to Tasks 2 and 3, gaining F-scores of 51.28% and 50.18%, respectively. The system thus addresses the BioNLP’09 Shared Task in its entirety and achieves the best performance on all three subtasks.</description></item><item rdf:about="http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00400.x" xmlns="http://purl.org/rss/1.0/"><title>BIO-MOLECULAR EVENT EXTRACTION WITH MARKOV LOGIC</title><link>http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00400.x</link><dc:title xmlns:dc="http://purl.org/dc/elements/1.1/">BIO-MOLECULAR EVENT EXTRACTION WITH MARKOV LOGIC</dc:title><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Sebastian Riedel</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Rune Sætre</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Hong-Woo Chun</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Toshihisa Takagi</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jun’ichi Tsujii</dc:creator><dc:date xmlns:dc="http://purl.org/dc/elements/1.1/">2011-11-01T00:00:00-05:00</dc:date><dc:identifier xmlns:dc="http://purl.org/dc/elements/1.1/">doi:10.1111/j.1467-8640.2011.00400.x</dc:identifier><dc:rights xmlns:dc="http://purl.org/dc/elements/1.1/"/><dc:publisher xmlns:dc="http://purl.org/dc/elements/1.1/">John Wiley &amp; Sons, Inc.</dc:publisher><prism:doi xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">10.1111/j.1467-8640.2011.00400.x</prism:doi><prism:url xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00400.x</prism:url><prism:startingPage xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">558</prism:startingPage><prism:endingPage xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">582</prism:endingPage><content:encoded xmlns:content="http://purl.org/rss/1.0/modules/content/"><![CDATA[<div class="para" xmlns:ol="http://www.wiley.com/namespaces/ol/xsl-lib" xmlns="http://www.w3.org/1999/xhtml"><p>This article presents a novel approach to event extraction from biological text using Markov Logic. It can be described by three design decisions: (1) instead of building a pipeline using local classifiers, we design and learn a joint probabilistic model over events in a sentence; (2) instead of developing specific inference and learning algorithms for our joint model, we apply Markov Logic, a general purpose Statistical Relation Learning language, for this task; (3) we represent events as relations over the token indices of a sentence, as opposed to structures that relate event entities to gene or protein mentions. In this article, we extend our original work by providing an error analysis for binding events. Moreover, we investigate the impact of different loss functions to precision, recall and F-measure. Finally, we show how to extract events of different types that share the same event clue. This extension allowed us to improve our performance our performance even further, leading to the third best scores for task 1 (in close range to the second place) and the best results for task 2 with a 14% point margin.</p></div>]]></content:encoded><description>This article presents a novel approach to event extraction from biological text using Markov Logic. It can be described by three design decisions: (1) instead of building a pipeline using local classifiers, we design and learn a joint probabilistic model over events in a sentence; (2) instead of developing specific inference and learning algorithms for our joint model, we apply Markov Logic, a general purpose Statistical Relation Learning language, for this task; (3) we represent events as relations over the token indices of a sentence, as opposed to structures that relate event entities to gene or protein mentions. In this article, we extend our original work by providing an error analysis for binding events. Moreover, we investigate the impact of different loss functions to precision, recall and F-measure. Finally, we show how to extract events of different types that share the same event clue. This extension allowed us to improve our performance our performance even further, leading to the third best scores for task 1 (in close range to the second place) and the best results for task 2 with a 14% point margin.</description></item><item rdf:about="http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00401.x" xmlns="http://purl.org/rss/1.0/"><title>EFFECTIVE BIO-EVENT EXTRACTION USING TRIGGER WORDS AND SYNTACTIC DEPENDENCIES</title><link>http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00401.x</link><dc:title xmlns:dc="http://purl.org/dc/elements/1.1/">EFFECTIVE BIO-EVENT EXTRACTION USING TRIGGER WORDS AND SYNTACTIC DEPENDENCIES</dc:title><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Halil Kilicoglu</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Sabine Bergler</dc:creator><dc:date xmlns:dc="http://purl.org/dc/elements/1.1/">2011-11-01T00:00:00-05:00</dc:date><dc:identifier xmlns:dc="http://purl.org/dc/elements/1.1/">doi:10.1111/j.1467-8640.2011.00401.x</dc:identifier><dc:rights xmlns:dc="http://purl.org/dc/elements/1.1/"/><dc:publisher xmlns:dc="http://purl.org/dc/elements/1.1/">John Wiley &amp; Sons, Inc.</dc:publisher><prism:doi xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">10.1111/j.1467-8640.2011.00401.x</prism:doi><prism:url xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00401.x</prism:url><prism:startingPage xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">583</prism:startingPage><prism:endingPage xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">609</prism:endingPage><content:encoded xmlns:content="http://purl.org/rss/1.0/modules/content/"><![CDATA[<div class="para" xmlns:ol="http://www.wiley.com/namespaces/ol/xsl-lib" xmlns="http://www.w3.org/1999/xhtml"><p>The scientific literature is the main source for comprehensive, up-to-date biological knowledge. Automatic extraction of this knowledge facilitates core biological tasks, such as database curation and knowledge discovery. We present here a linguistically inspired, rule-based and syntax-driven methodology for biological event extraction. We rely on a dictionary of trigger words to detect and characterize event expressions and syntactic dependency based heuristics to extract their event arguments. We refine and extend our prior work to recognize speculated and negated events. We show that heuristics based on syntactic dependencies, used to identify event arguments, extend naturally to also identify speculation and negation scope. In the BioNLP’09 Shared Task on Event Extraction, our system placed third in the Core Event Extraction Task (F-score of 0.4462), and first in the Speculation and Negation Task (F-score of 0.4252). Of particular interest is the extraction of complex regulatory events, where it scored second place. Our system significantly outperformed other participating systems in detecting speculation and negation. These results demonstrate the utility of a syntax-driven approach. In this article, we also report on our more recent work on supervised learning of event trigger expressions and discuss event annotation issues, based on our corpus analysis.</p></div>]]></content:encoded><description>The scientific literature is the main source for comprehensive, up-to-date biological knowledge. Automatic extraction of this knowledge facilitates core biological tasks, such as database curation and knowledge discovery. We present here a linguistically inspired, rule-based and syntax-driven methodology for biological event extraction. We rely on a dictionary of trigger words to detect and characterize event expressions and syntactic dependency based heuristics to extract their event arguments. We refine and extend our prior work to recognize speculated and negated events. We show that heuristics based on syntactic dependencies, used to identify event arguments, extend naturally to also identify speculation and negation scope. In the BioNLP’09 Shared Task on Event Extraction, our system placed third in the Core Event Extraction Task (F-score of 0.4462), and first in the Speculation and Negation Task (F-score of 0.4252). Of particular interest is the extraction of complex regulatory events, where it scored second place. Our system significantly outperformed other participating systems in detecting speculation and negation. These results demonstrate the utility of a syntax-driven approach. In this article, we also report on our more recent work on supervised learning of event trigger expressions and discuss event annotation issues, based on our corpus analysis.</description></item><item rdf:about="http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00402.x" xmlns="http://purl.org/rss/1.0/"><title>SYNTACTIC SIMPLIFICATION AND SEMANTIC ENRICHMENT—TRIMMING DEPENDENCY GRAPHS FOR EVENT EXTRACTION</title><link>http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00402.x</link><dc:title xmlns:dc="http://purl.org/dc/elements/1.1/">SYNTACTIC SIMPLIFICATION AND SEMANTIC ENRICHMENT—TRIMMING DEPENDENCY GRAPHS FOR EVENT EXTRACTION</dc:title><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Ekaterina Buyko</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Erik Faessler</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Joachim Wermter</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Udo Hahn</dc:creator><dc:date xmlns:dc="http://purl.org/dc/elements/1.1/">2011-11-01T00:00:00-05:00</dc:date><dc:identifier xmlns:dc="http://purl.org/dc/elements/1.1/">doi:10.1111/j.1467-8640.2011.00402.x</dc:identifier><dc:rights xmlns:dc="http://purl.org/dc/elements/1.1/"/><dc:publisher xmlns:dc="http://purl.org/dc/elements/1.1/">John Wiley &amp; Sons, Inc.</dc:publisher><prism:doi xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">10.1111/j.1467-8640.2011.00402.x</prism:doi><prism:url xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00402.x</prism:url><prism:startingPage xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">610</prism:startingPage><prism:endingPage xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">644</prism:endingPage><content:encoded xmlns:content="http://purl.org/rss/1.0/modules/content/"><![CDATA[<div class="para" xmlns:ol="http://www.wiley.com/namespaces/ol/xsl-lib" xmlns="http://www.w3.org/1999/xhtml"><p>In our approach to event extraction, dependency graphs constitute the fundamental data structure for knowledge capture. Two types of trimming operations pave the way to more effective relation extraction. First, we simplify the syntactic representation structures resulting from parsing by pruning informationally irrelevant lexical material from dependency graphs. Second, we enrich informationally relevant lexical material in the simplified dependency graphs with additional semantic meta data at several layers of conceptual granularity. These two aggregation operations on linguistic representation structures are intended to avoid overfitting of machine learning-based classifiers which we use for event extraction (besides manually curated dictionaries). Given this methodological framework, the corresponding <span class="smallCaps">JReX</span> system developed by the <span class="smallCaps">Julie</span>Lab Team from Friedrich-Schiller-Universität Jena (Germany) scored on 2nd rank among 24 competing teams for Task 1 in the “BioNLP’09 Shared Task on Event Extraction,” with 45.8% recall, 47.5% precision and 46.7% F1-score on all 3,182 events. In more recent experiments, based on slight modifications of <span class="smallCaps">JReX</span> and using the same data sets, we were able to achieve 45.9% recall, 57.7% precision, and 51.1% F1-score.</p></div>]]></content:encoded><description>In our approach to event extraction, dependency graphs constitute the fundamental data structure for knowledge capture. Two types of trimming operations pave the way to more effective relation extraction. First, we simplify the syntactic representation structures resulting from parsing by pruning informationally irrelevant lexical material from dependency graphs. Second, we enrich informationally relevant lexical material in the simplified dependency graphs with additional semantic meta data at several layers of conceptual granularity. These two aggregation operations on linguistic representation structures are intended to avoid overfitting of machine learning-based classifiers which we use for event extraction (besides manually curated dictionaries). Given this methodological framework, the corresponding JReX system developed by the JulieLab Team from Friedrich-Schiller-Universität Jena (Germany) scored on 2nd rank among 24 competing teams for Task 1 in the “BioNLP’09 Shared Task on Event Extraction,” with 45.8% recall, 47.5% precision and 46.7% F1-score on all 3,182 events. In more recent experiments, based on slight modifications of JReX and using the same data sets, we were able to achieve 45.9% recall, 57.7% precision, and 51.1% F1-score.</description></item><item rdf:about="http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00403.x" xmlns="http://purl.org/rss/1.0/"><title>HIGH-PRECISION BIO-MOLECULAR EVENT EXTRACTION FROM TEXT USING PARALLEL BINARY CLASSIFIERS</title><link>http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00403.x</link><dc:title xmlns:dc="http://purl.org/dc/elements/1.1/">HIGH-PRECISION BIO-MOLECULAR EVENT EXTRACTION FROM TEXT USING PARALLEL BINARY CLASSIFIERS</dc:title><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Sofie Van Landeghem</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Bernard De Baets</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Yves Van de Peer</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Yvan Saeys</dc:creator><dc:date xmlns:dc="http://purl.org/dc/elements/1.1/">2011-11-01T00:00:00-05:00</dc:date><dc:identifier xmlns:dc="http://purl.org/dc/elements/1.1/">doi:10.1111/j.1467-8640.2011.00403.x</dc:identifier><dc:rights xmlns:dc="http://purl.org/dc/elements/1.1/"/><dc:publisher xmlns:dc="http://purl.org/dc/elements/1.1/">John Wiley &amp; Sons, Inc.</dc:publisher><prism:doi xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">10.1111/j.1467-8640.2011.00403.x</prism:doi><prism:url xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00403.x</prism:url><prism:startingPage xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">645</prism:startingPage><prism:endingPage xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">664</prism:endingPage><content:encoded xmlns:content="http://purl.org/rss/1.0/modules/content/"><![CDATA[<div class="para" xmlns:ol="http://www.wiley.com/namespaces/ol/xsl-lib" xmlns="http://www.w3.org/1999/xhtml"><p>We have developed a machine learning framework to accurately extract complex genetic interactions from text. Employing type-specific classifiers, this framework processes research articles to extract various biological events. Subsequently, the algorithm identifies regulation events that take other events as arguments, allowing a nested structure of predictions. All predictions are merged into an integrated network, useful for visualization and for deduction of new biological knowledge.</p></div><div class="para" xmlns="http://www.w3.org/1999/xhtml"><p>In this paper, we discuss several design choices for an event-based extraction framework. These detailed studies help improving on existing systems, which is illustrated by the relative performance gain of 10% of our system compared to the official results in the recent BioNLP’09 Shared Task. Our framework now achieves state-of-the-art performance with 37.43 recall, 54.81 precision and 44.48 F-score.</p></div><div class="para" xmlns="http://www.w3.org/1999/xhtml"><p>We further present the first study of feature selection for bio-molecular event extraction from text. While producing more cost-effective models, feature selection can also lead to a better insight into the complexity of the challenge.</p></div><div class="para" xmlns="http://www.w3.org/1999/xhtml"><p>Finally, this paper tries to bridge the gap between theoretical relation extraction from text and experimental work on bio-molecular interactions by discussing interesting opportunities to employ event-based text mining tools for real-life tasks such as hypothesis generation, database curation and knowledge discovery.</p></div>]]></content:encoded><description>We have developed a machine learning framework to accurately extract complex genetic interactions from text. Employing type-specific classifiers, this framework processes research articles to extract various biological events. Subsequently, the algorithm identifies regulation events that take other events as arguments, allowing a nested structure of predictions. All predictions are merged into an integrated network, useful for visualization and for deduction of new biological knowledge.In this paper, we discuss several design choices for an event-based extraction framework. These detailed studies help improving on existing systems, which is illustrated by the relative performance gain of 10% of our system compared to the official results in the recent BioNLP’09 Shared Task. Our framework now achieves state-of-the-art performance with 37.43 recall, 54.81 precision and 44.48 F-score.We further present the first study of feature selection for bio-molecular event extraction from text. While producing more cost-effective models, feature selection can also lead to a better insight into the complexity of the challenge.Finally, this paper tries to bridge the gap between theoretical relation extraction from text and experimental work on bio-molecular interactions by discussing interesting opportunities to employ event-based text mining tools for real-life tasks such as hypothesis generation, database curation and knowledge discovery.</description></item><item rdf:about="http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00404.x" xmlns="http://purl.org/rss/1.0/"><title>MOLECULAR EVENT EXTRACTION FROM LINK GRAMMAR PARSE TREES IN THE BIONLP’09 SHARED TASK</title><link>http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00404.x</link><dc:title xmlns:dc="http://purl.org/dc/elements/1.1/">MOLECULAR EVENT EXTRACTION FROM LINK GRAMMAR PARSE TREES IN THE BIONLP’09 SHARED TASK</dc:title><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jörg Hakenberg</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Illés Solt</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Domonkos Tikk</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Võ Há Nguyên</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Luis Tari</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Quang Long Nguyen</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Chitta Baral</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Ulf Leser</dc:creator><dc:date xmlns:dc="http://purl.org/dc/elements/1.1/">2011-11-01T00:00:00-05:00</dc:date><dc:identifier xmlns:dc="http://purl.org/dc/elements/1.1/">doi:10.1111/j.1467-8640.2011.00404.x</dc:identifier><dc:rights xmlns:dc="http://purl.org/dc/elements/1.1/"/><dc:publisher xmlns:dc="http://purl.org/dc/elements/1.1/">John Wiley &amp; Sons, Inc.</dc:publisher><prism:doi xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">10.1111/j.1467-8640.2011.00404.x</prism:doi><prism:url xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00404.x</prism:url><prism:startingPage xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">665</prism:startingPage><prism:endingPage xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">680</prism:endingPage><content:encoded xmlns:content="http://purl.org/rss/1.0/modules/content/"><![CDATA[<div class="para" xmlns:ol="http://www.wiley.com/namespaces/ol/xsl-lib" xmlns="http://www.w3.org/1999/xhtml"><p>The BioNLP’09 Shared Task deals with extracting information on molecular events, such as gene expression and protein localization, from natural language text. Information in this benchmark are given as tuples including protein names, trigger terms for each event, and possible other participants such as bindings sites. We address all three tasks of BioNLP’09: event detection, event enrichment, and recognition of negation and speculation. Our method for the first two tasks is based on a deep parser; we store the parse tree of each sentence in a relational database scheme. From the training data, we collect the dependencies connecting any two relevant terms of a known tuple, that is, the shortest paths linking these two constituents. We encode all such linkages in a query language to retrieve similar linkages from unseen text. For the third task, we rely on a hierarchy of hand-crafted regular expressions to recognize speculation and negated events. In this paper, we added extensions regarding a post-processing step that handles ambiguous event trigger terms, as well as an extension of the query language to relax linkage constraints. On the BioNLP Shared Task test data, we achieve an overall F1-measure of 32%, 29%, and 30% for the successive Tasks 1, 2, and 3, respectively.</p></div>]]></content:encoded><description>The BioNLP’09 Shared Task deals with extracting information on molecular events, such as gene expression and protein localization, from natural language text. Information in this benchmark are given as tuples including protein names, trigger terms for each event, and possible other participants such as bindings sites. We address all three tasks of BioNLP’09: event detection, event enrichment, and recognition of negation and speculation. Our method for the first two tasks is based on a deep parser; we store the parse tree of each sentence in a relational database scheme. From the training data, we collect the dependencies connecting any two relevant terms of a known tuple, that is, the shortest paths linking these two constituents. We encode all such linkages in a query language to retrieve similar linkages from unseen text. For the third task, we rely on a hierarchy of hand-crafted regular expressions to recognize speculation and negated events. In this paper, we added extensions regarding a post-processing step that handles ambiguous event trigger terms, as well as an extension of the query language to relax linkage constraints. On the BioNLP Shared Task test data, we achieve an overall F1-measure of 32%, 29%, and 30% for the successive Tasks 1, 2, and 3, respectively.</description></item><item rdf:about="http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00405.x" xmlns="http://purl.org/rss/1.0/"><title>HIGH-PRECISION BIOLOGICAL EVENT EXTRACTION: EFFECTS OF SYSTEM AND OF DATA</title><link>http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00405.x</link><dc:title xmlns:dc="http://purl.org/dc/elements/1.1/">HIGH-PRECISION BIOLOGICAL EVENT EXTRACTION: EFFECTS OF SYSTEM AND OF DATA</dc:title><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">K. Bretonnel Cohen</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Karin Verspoor</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">
            Helen L. Johnson</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Chris Roeder</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">
            Philip V. Ogren</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">
            William A. 
            Baumgartner Jr</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Elizabeth White</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Hannah Tipney</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Lawrence Hunter</dc:creator><dc:date xmlns:dc="http://purl.org/dc/elements/1.1/">2011-11-01T00:00:00-05:00</dc:date><dc:identifier xmlns:dc="http://purl.org/dc/elements/1.1/">doi:10.1111/j.1467-8640.2011.00405.x</dc:identifier><dc:rights xmlns:dc="http://purl.org/dc/elements/1.1/"/><dc:publisher xmlns:dc="http://purl.org/dc/elements/1.1/">John Wiley &amp; Sons, Inc.</dc:publisher><prism:doi xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">10.1111/j.1467-8640.2011.00405.x</prism:doi><prism:url xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00405.x</prism:url><prism:startingPage xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">681</prism:startingPage><prism:endingPage xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">701</prism:endingPage><content:encoded xmlns:content="http://purl.org/rss/1.0/modules/content/"><![CDATA[<div class="para" xmlns:ol="http://www.wiley.com/namespaces/ol/xsl-lib" xmlns="http://www.w3.org/1999/xhtml"><p>We approached the problems of event detection, argument identification, and negation and speculation detection in the BioNLP’09 information extraction challenge through concept recognition and analysis. Our methodology involved using the OpenDMAP semantic parser with manually written rules. The original OpenDMAP system was updated for this challenge with a broad ontology defined for the events of interest, new linguistic patterns for those events, and specialized coordination handling. We achieved state-of-the-art precision for two of the three tasks, scoring the highest of 24 teams at precision of 71.81 on Task 1 and the highest of 6 teams at precision of 70.97 on Task 2. We provide a detailed analysis of the training data and show that a number of trigger words were ambiguous as to event type, even when their arguments are constrained by semantic class. The data is also shown to have a number of missing annotations. Analysis of a sampling of the comparatively small number of false positives returned by our system shows that major causes of this type of error were failing to recognize second themes in two-theme events, failing to recognize events when they were the arguments to other events, failure to recognize nontheme arguments, and sentence segmentation errors. We show that specifically handling coordination had a small but important impact on the overall performance of the system. The OpenDMAP system and the rule set are available at <!--TODO: clickthrough URL--><a href="http://bionlp.sourceforge.net" title="Link to external resource: http://bionlp.sourceforge.net">http://bionlp.sourceforge.net</a>.</p></div>]]></content:encoded><description>We approached the problems of event detection, argument identification, and negation and speculation detection in the BioNLP’09 information extraction challenge through concept recognition and analysis. Our methodology involved using the OpenDMAP semantic parser with manually written rules. The original OpenDMAP system was updated for this challenge with a broad ontology defined for the events of interest, new linguistic patterns for those events, and specialized coordination handling. We achieved state-of-the-art precision for two of the three tasks, scoring the highest of 24 teams at precision of 71.81 on Task 1 and the highest of 6 teams at precision of 70.97 on Task 2. We provide a detailed analysis of the training data and show that a number of trigger words were ambiguous as to event type, even when their arguments are constrained by semantic class. The data is also shown to have a number of missing annotations. Analysis of a sampling of the comparatively small number of false positives returned by our system shows that major causes of this type of error were failing to recognize second themes in two-theme events, failing to recognize events when they were the arguments to other events, failure to recognize nontheme arguments, and sentence segmentation errors. We show that specifically handling coordination had a small but important impact on the overall performance of the system. The OpenDMAP system and the rule set are available at http://bionlp.sourceforge.net.</description></item><item rdf:about="http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00406.x" xmlns="http://purl.org/rss/1.0/"><title>EXTRACTING SECONDARY BIO-EVENT ARGUMENTS WITH EXTRACTION CONSTRAINTS</title><link>http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00406.x</link><dc:title xmlns:dc="http://purl.org/dc/elements/1.1/">EXTRACTING SECONDARY BIO-EVENT ARGUMENTS WITH EXTRACTION CONSTRAINTS</dc:title><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Yutaka Sasaki</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Xinglong Wang</dc:creator><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Sophia Ananiadou</dc:creator><dc:date xmlns:dc="http://purl.org/dc/elements/1.1/">2011-11-01T00:00:00-05:00</dc:date><dc:identifier xmlns:dc="http://purl.org/dc/elements/1.1/">doi:10.1111/j.1467-8640.2011.00406.x</dc:identifier><dc:rights xmlns:dc="http://purl.org/dc/elements/1.1/"/><dc:publisher xmlns:dc="http://purl.org/dc/elements/1.1/">John Wiley &amp; Sons, Inc.</dc:publisher><prism:doi xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">10.1111/j.1467-8640.2011.00406.x</prism:doi><prism:url xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">http://dx.doi.org/10.1111%2Fj.1467-8640.2011.00406.x</prism:url><prism:startingPage xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">702</prism:startingPage><prism:endingPage xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">721</prism:endingPage><content:encoded xmlns:content="http://purl.org/rss/1.0/modules/content/"><![CDATA[<div class="para" xmlns:ol="http://www.wiley.com/namespaces/ol/xsl-lib" xmlns="http://www.w3.org/1999/xhtml"><p>This paper describes our bio-event extraction system developed for the BioNLP 2009 Shared Task 2, with focus on its capability of extracting secondary biological event arguments from literature. Shared Task 2 is particularly interesting because when browsing literature, biologists often need to understand conditions surrounding biological events, which are usually expressed by secondary event arguments (e.g., binding sites). To achieve our goal, we take an approach that extracts <em>n</em>-ary relations from text using <em>event extraction constraints</em> automatically generated from a training corpus. Event constraints consist of sequences of trigger words and semantic roles which we automatically identify using <em>Conditional Random Fields</em> (CRFs). Unlike most other systems participating in this shared task, our system is light-weight and relies on neither external resources (e.g., Ontologies and dictionaries) nor natural language processing software (e.g., POS taggers and parsers). The official test results show that our approach performed well on extracting secondary arguments in Task 2, yielding the highest precision at 76.62% and the second highest F-measure at 43.22%.</p></div>]]></content:encoded><description>This paper describes our bio-event extraction system developed for the BioNLP 2009 Shared Task 2, with focus on its capability of extracting secondary biological event arguments from literature. Shared Task 2 is particularly interesting because when browsing literature, biologists often need to understand conditions surrounding biological events, which are usually expressed by secondary event arguments (e.g., binding sites). To achieve our goal, we take an approach that extracts n-ary relations from text using event extraction constraints automatically generated from a training corpus. Event constraints consist of sequences of trigger words and semantic roles which we automatically identify using Conditional Random Fields (CRFs). Unlike most other systems participating in this shared task, our system is light-weight and relies on neither external resources (e.g., Ontologies and dictionaries) nor natural language processing software (e.g., POS taggers and parsers). The official test results show that our approach performed well on extracting secondary arguments in Task 2, yielding the highest precision at 76.62% and the second highest F-measure at 43.22%.</description></item></rdf:RDF>
