- Ahmed Amrani, Yves
Kodratoff, and Oriane Matte-Tailliez.
A semi-automatic system for tagging specialized corpora.
In Advances in Knowledge Discovery and Data Mining 8th Pacific-Asia
Conference, PAKDD 2004, volume 3056 of Lecture Notes in Computer
Science, pages 670-681. Springer, 2004.
- M. Ashburner,
C. A. Ball, J. A. Blake, D. Botstein, H. Butler, J. M. Cherry, A. P. Davis,
K. Dolinski, S. S. Dwight, J. T. Eppig, M. A. Harris, D. P. Hill,
L. Issel-Tarver, A. Kasarskis, S. Lewis, J. C. Matese, J. E. Richardson,
M. Ringwald, and G. M. Rubin & G. Sherlock.
Gene ontology: tool for the unification of biology.
Nature Genetics, 25:25-29, May 2000.
- K. Baclawski,
J. Cigna, M. M. Kokar, P. Mager, and B. Indurkhya.
Knowledge representation and indexing using the unified medical language
system.
In Proceedings of the Pacific Symposium on Biocomputing, pages
493-504, 2000.
- C. Blaschke and A. Valencia.
The frame-based module of the suiseki information extraction system.
IEEE Intelligent Systems, (17):14-20, 2002.
(PDF)
- Olivier Bodenreider and Serguei V. Pakhomov.
Exploring adjectival modification in biomedical discourse across two genres.
In Sophia Ananiadou and Jun'ichi Tsujii, editors, Proceedings of the ACL
2003 Workshop on Natural Language Processing in Biomedicine, pages
105-112, 2003.
(PDF)
- B. Bruijn and
J. Martin.
Getting to the (c)ore of knowledge: mining biomedical literature.
Int. Journal Medical Informatics, (67):7-18, 2002.
- R. Bunescu, R. Ge,
R.J. Kate, E.M. Marcotte, R.J. Mooney, A.K. Ramani, and Y.W. Wong.
Comparative experiments on learning information extractors for proteins and
their interactions.
Artificial Intelligence in Medicine (Special Issue on Summarization and
Information Extraction from Medical Documents), 2004.
- K. Bretonnel Cohen
and Lawrence Hunter.
Natural language processing and systems biology.
Technical report, University of Colorado School of Medicine Denver, CO, USA,
2004.
- N. Collier,
C. Nobata, and J. Tsujii.
Extracting the names of genes and gene products with a hidden markov model.
In Proceedings of the 18th International Conference on Computational
Linguistics (COLING'2000), Saarbrucken, Germany, July 2000.
- N. Collier,
C. Nobata, and J. Tsujii.
Automatic acquisition and classification of terminology using a tagged corpus
in the molecular biology domain.
Journal of Terminology, John Benjamins, 7(2):239-257, 2002.
- D. P. Corney, B. F.
Buxton, W. B. Langdon, and D. T. Jones.
BioRAT: Extracting biological information from full-length papers.
Bioinformatics, July 2004.
- M. Craven and
J. Kumlien.
Constructing biological knowledge bases by extracting information from text
sources.
In Proceedings of International Conference on Intelligent Systems for
Molecular Biology, pages 77-86, 1999.
- D. Cutting,
J. Kupiec, J. Pedersen, and P. Sibun.
A practical part-of-speech tagger.
In Proceedings of the Third Conference on Applied Natural Language
Processing, pages 133-140, 1992.
- S. Dickman.
Tough mining.
PLoS Biology, 1(2):144-147, 2003.
- Tomaz Erjavec,
Jin-Dong Kim, Tomoko Ohta, Yuka Tateisi, and Jun'ichi Tsujii.
Encoding biomedical
resources in TEI: The case of the GENIA corpus.
In Sophia Ananiadou and Jun'ichi Tsujii, editors, Proceedings of the ACL
2003 Workshop on Natural Language Processing in Biomedicine, pages
97-104, 2003.
- K. Franzen,
G. Eriksson, F. Olsson, L. Asker, P. Liden, and J. Coster.
Protein names: how to find them.
Int J Med Inf., 4(67):49-61, December 2002.
- C. Friedman,
P. Kra M. Krauthammer, H. Yu, and A. Rzhetsky.
GENIES: a natural-langauge processing system for the extraction of molecular
pathways from journal articles.
Bioinformatics, 17(1):74-82, 2001.
- K. Fukuda,
T. Tsunoda, A. Tamura, and T. Takagi.
Toward information extraction: Identifying protein names from biological
papers.
In Proc. of the Pacific Symposium on Biocomputing, 1998.
- H. Harkema,
R. Gaizauskas, M. Hepple, A. Roberts, I. Roberts, N. Davis, and Y. Guo.
A large-scale terminology resource for biomedical text processing.
In BionLINK 2004: Linking Biological Literature, Ontologies, and
Databases, ACL, pages 53-60, 2004.
- M. Hepple.
Indepedence and commitment: Assumptions for rapid training and execution of
rule-based POS taggers.
In Proceedings of the 38th Annual Meeting of the Association for
Computational Linguistics (ACL-2000), pages 278-285, 2000.
(PDF)
- L. Hirschman,
L. Wong J.C. Park, J. Tsujii, and C. H. Wu.
Accomplishments and challenges in literature data mining for biology.
Bioinformatics, 18(12):1553-1561, 2002.
- Jerry R. Hobbs.
Information extraction from biomedical text.
Journal of Biomedical Informatics, 2004.
- Wen-Juan Hou and
Hsin-Hsi Chen.
Enhancing performance of protein name recognizers using collocation.
In Sophia Ananiadou and Jun'ichi Tsujii, editors, Proceedings of the ACL
2003 Workshop on Natural Language Processing in Biomedicine, pages
25-32, 2003.
(PDF)
- Minlie Huang, Xiaoyan
Zhu, Yu Hao, Donald G. Payan, Kunbin Qu, and Ming Li.
Discovering patterns to extract protein-protein interactions from full texts.
Bioinformatics, 20(18):3604-3612, July 2004.
- B. L.
Humphreys, D. A. Lindberg, H. M. Schoolman, and G. O. Barnett.
The unified medical language system: an informatics research collaboration.
Journal of the American Medical Informatics Assoc., 5(1):1-11,
1998.
- K. Humphreys,
G. Demetriou, and R. Gaizauskas.
Two applications of information extraction to biological science journal
articles: Enzyme interactions and protein structures.
In Proceedings of the Pacific Symposium on Biocomputing
(PSB-2000), pages 505-516, January 2000.
- L. Hunter.
Artificial Intelligence and Molecular Biology, chapter Molecular
Biology for Computer Scientists.
AAAI Press, 1993.
(PDF)
- J. Kazama, Y. Miyao,
and J. Tsujii.
A maximum entropy tagger with unsupervised hidden markov models.
In Proceedings of the Sixth Natural Language Processing Pacific Rim
Symposium (NLPRS2001), pages 333-340, 2001.
- J. D. Kim, T. Ohta,
Y. Tatisi, and J. Tsujii.
GENIA corpus - a semantically annotated corups for bio-textmining.
Bioinformatics, 19(1):180-182, 2003.
- S. Kulick, A. Bies,
M. Liberman, M. Mandel, R. McDonald, M. Palmer, A. Schein, and L. Ungar.
Integrated annotation for biomedical information extraction.
In NAACL/HLT Workshop on Linking Biological Literature, Ontologies and
Databases: Tools for Users, pages 61-68, 2004.
- Ki-Joong Lee, Young-Sook
Hwang, and Hae-Chang Rim.
Two-phase biomedical NE recognition based on SVMs.
In Sophia Ananiadou and Jun'ichi Tsujii, editors, Proceedings of the ACL
2003 Workshop on Natural Language Processing in Biomedicine, pages
33-40, 2003.
(PDF)
- G. Leroy, H. Chen, and
J. D. Martinez.
A shallow parser based on closed-class words to capture relations in biomedical
text.
Journal of Biomedical Informatics, 36:145-158, 2003.
- Bisharah Libbus,
Halil Kilicoglu, Thomas C. Rindflesch, James G. Mork, and Alan R. Aronson.
Using natural language processing, locuslink and the gene ontology to compare
OMIM to MEDLINE.
In NAACL/HLT Workshop on Linking Biological Literature, Ontologies and
Databases: Tools for Users, pages 69-76, 2004.
- D. M. McDonald,
H. Chen, H. Su, and B. B. Marshall.
Extracting gene pathway relations using a hybrid grammar: the arizona relation
parser.
Bioinformatics, 20(18):3370-3378, July 2004.
- Alex Morgan, Lynette
Hirschman, Alexander Yeh, and Marc Colosimo.
Gene name extraction using FlyBase resources.
In Sophia Ananiadou and Jun'ichi Tsujii, editors, Proceedings of the ACL
2003 Workshop on Natural Language Processing in Biomedicine, pages
1-8, 2003.
(PDF)
- Hans-Michael
Müller, Eimear E. Kenny, and Paul W. Sternberg.
Textpresso: An ontology-based information retrieval and extraction system for
biological literature.
Plos Biology, 2(11), November 2004.
- G. Nenadic,
I. Spasi, and S. Ananiadou.
Terminology-driven mining of biomedical literature.
In Proceedings of the 2003 ACM symposium on Applied computing table of
contents Melbourne, Florida, pages 83-87. ACM Press New York, NY,
USA, 2003.
- Goran
Nenadic, Simon Rice, Irena Spasic, Sophia Ananiadou, and Benjamin
Stapley.
Selecting text features for gene name classification: from documents to terms.
In Sophia Ananiadou and Jun'ichi Tsujii, editors, Proceedings of the ACL
2003 Workshop on Natural Language Processing in Biomedicine, pages
121-128, 2003.
(PDF)
- J. Casta no, J. Zhang,
and J. Pustejovsky.
Anaphora resolution in biomedical literature.
In International Symposium on Reference Resolution, 2002.
- T. Ohta, Y. Tatisi, and
J. D. Kim.
GENIA corpus: an annotated research abstract corpus in molecular biology
domain.
In Proceedings of he Human Language Technology Conference (HLT
2002), pages 73-77, 2002.
- H. Pearson.
Biology's name game.
Nature, (411):631-632, June 2001.
- D. Proux,
F. Rechenmann, L. Julliard, V. Pillet, and B. Jacq.
Detecting gene symbols and names in biological texts: A first step toward
pertinent information extraction.
Genome Inform Ser Workshop Genome Inform., 9:72-80, 1998.
- J. Pustejovsky, J. Castano, and J. Zhang.
Robust relational parsing over biomedical literature: Extracting inhibit
relations.
In Proceedings of the Pacific Symposium on Biocomputing, pages
362-373, 2002.
- S. Ray and M. Craven.
Representing sentence structure in hidden markov models for information
extraction.
In Proc. of the Int. Joint Conf. on Artificial Intelligence,
2001.
- T. C.
Rindflesch, L. Tanabe, J. N. Weinstein, and L. Hunter.
EDGAR: Extraction of drugs, genes, and relations from the biomedical
literature.
In Proc. Pacific Symposium on Biocomputing, pages 514-525,
2000.
- T. C.
Rindflesch, L. Bisharah, H. Dimitar, and A. R. Aronson;and H. Kilicoglu.
Semantic relations asserting the etiology of genetic diseases.
In Proceedings of the AMIA Annual Symposium, 2003.
- Barbara
Rosario and Marti Hearst.
Classifying the semantic relations in noun compounds via a domain-specific
lexical hierarchy.
In Proceedings of 2001 Conference on Empirical Methods in Natural
Language Processing, Pittsburgh, PA (EMNLP 2001), 2001.
- A. Rzhetsky,
T. Koike, S. Kalachikov, SM. Gomez, M. Krauthammer, SH. Kaplan, P. Kra, JJ.
Russo, and C. Friedman.
A knowledge model for analysis and simulation of regulatory networks.
Bioinformatics, 16(12):1120-1128, 2000.
- T. Sekimizu,
H. S. Park, and J. Tsujii.
Identifying the interaction between genes and gene products based on frequently
seen verbs in medline abstracts.
Genome Informatics, pages 62-71, 1998.
- H. Shatkay
and R. Feldman.
Mining the biomedical literature in the genomic era: An overview.
Journal of Computational Biology (JCB), 10(6):821-856, December
2003.
- Dan Shen, Jie Zhang,
Guodong Zhou, Jian Su, and Chew-Lim Tan.
Effective adaptation of hidden Markov model-based named entity recognizer for
biomedical domain.
In Sophia Ananiadou and Jun'ichi Tsujii, editors, Proceedings of the ACL
2003 Workshop on Natural Language Processing in Biomedicine, pages
49-56, 2003.
(PDF)
- M. Skounakis,
M. Craven, and S. Ray.
Hierarchical
Hidden Markov Models for information extraction.
In Proceedings of the 18th International Joint Conference on Artificial
Intelligence, Acapulco, Mexico. Morgan Kaufmann., 2003., 2003.
- L. Smith,
T. Rindflesch, and W. J. Wilbur.
Medpost: a part-of-speech tagger for biomedical text.
Bioinformatics, 20(14), 2004.
- Koichi
Takeuchi and Nigel Collier.
Bio-medical entity extraction using support vector machines.
In Sophia Ananiadou and Jun'ichi Tsujii, editors, Proceedings of the ACL
2003 Workshop on Natural Language Processing in Biomedicine, pages
57-64, 2003.
(PDF)
- L. Tanabe,
U. Scherf, L. H. Smith, J. K. Lee, L. Hunter, and J. N. Weinstein.
Medminer: an internet text-mining tool for biomedical information, with
application to gene expression profiling.
BioTechniques, 27:1210-1217, 1999.
- Y. Tateisi and
J. Tsujii.
Part-of-speech annotation of biology research abstracts.
In Proceedings of 4th International Conference on Language Resource and
Evaluation (LREC2004), pages 1267-1270, 2004.
- Y. Tateisi,
T. Ohta, and J. Tsujii.
Annotation of predicate-argument structure of molecular biology text.
In JCNLP-04 workshop on Beyond Shallow Analyses, 2004.
(PDF)
- James Thomas, David
Milward, Christos Ouzounis, Stephen Pulman, and Mark Carroll.
Automatic extraction of protein interactions from scientific abstracts.
In Proceedings of the Pacific Symposium on Biocomputing, pages
538-549, 2000.
- Manabu Torii, Sachin
Kamboj, and K. Vijay-Shanker.
An investigation of various information sources for classifying biological
names.
In Sophia Ananiadou and Jun'ichi Tsujii, editors, Proceedings of the ACL
2003 Workshop on Natural Language Processing in Biomedicine, pages
113-120, 2003.
(PDF)
- Yoshimasa
Tsuruoka and Jun'ichi Tsujii.
Boosting precision and recall of dictionary-based protein name recognition.
In Sophia Ananiadou and Jun'ichi Tsujii, editors, Proceedings of the ACL
2003 Workshop on Natural Language Processing in Biomedicine, pages
41-48, 2003.
(PDF)
- N. Uramoto,
H. Matsuzawa, T. Nagano, A. Murakami, H. Takeuchi, , and K. Takeda.
A text-mining system for knowledge discovery from biomedical documents.
IBM Systems Journal, 43(3):516-533, 2004.
- A. Yakushiji,
Y. Tateisi Y. Miyao, and J. Tsujii.
Event extraction from biomedical papers using a full parser.
In Proceedings of the sixth Pacific Symposium on Biocomputing (PSB
2001), pages 408-419, 2001.
(PDF)
- Kaoru Yamamoto,
Taku Kudo, Akihiko Konagaya, and Yuji Matsumoto.
Protein name tagging for biomedical annotation in text.
In Sophia Ananiadou and Jun'ichi Tsujii, editors, Proceedings of the ACL
2003 Workshop on Natural Language Processing in Biomedicine, pages
65-72, 2003.
(PDF)
- H. Yu, C. Friedman, and
A. Rhzetsky abd P. Kra.
Representing genomic knowledge in the UMLS semantic network.
In Proc AMIA Symp., pages 181-185, 1999.