Abstract
A major challenge in text mining for biomedicine is automatically extracting protein-protein interactions from the vast amount of biomedical literature. We have constructed an information extraction system based on the Hidden Vector State (HVS) model for protein-protein interactions. The HVS model can be trained using only lightly annotated data whilst simultaneously retaining sufficient ability to capture the hierarchical structure. When applied in extracting protein-protein interactions, we found that it performed better than other established statistical methods and achieved 61.5% in F-score with balanced recall and precision values. Moreover, the statistical nature of the pure data-driven HVS model makes it intrinsically robust and it can be easily adapted to other domains.
| Original language | English |
|---|---|
| Pages (from-to) | 64-80 |
| Number of pages | 17 |
| Journal | International Journal of Bioinformatics Research and Applications |
| Volume | 4 |
| Issue number | 1 |
| DOIs | |
| Publication status | Published - Feb 2008 |
Bibliographical note
International journal of bioinformatics research and applications (4, 2008) http://www.inderscience.com/offer.php?id=17164© Inderscience Enterprises Ltd.
Keywords
- information extraction
- hidden vector state model
- protein-protein interactions extraction
Fingerprint
Dive into the research topics of 'Extracting protein-protein interactions from MEDLINE using the hidden vector state model'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver