Skip to Main Content
Frequently Asked Questions
Submit an ETD
Global Search Box
Need Help?
Keyword Search
Participating Institutions
Advanced Search
School Logo
Files
File List
msthesis.pdf (297.96 KB)
ETD Abstract Container
Abstract Header
Predicting Knowledge Base Revisions from Realtime Text Streams
Author Info
Konovalov, Alexander
ORCID® Identifier
http://orcid.org/0000-0002-5048-0298
Permalink:
http://rave.ohiolink.edu/etdc/view?acc_num=osu153200096922783
Abstract Details
Year and Degree
2018, Master of Science, Ohio State University, Computer Science and Engineering.
Abstract
Broad-coverage knowledge bases (KBs), such as Freebase, NELL, DBPedia, and Wikidata, Microsoft's Satori and Google's Knowledge Graph contain large collections of structured facts about things, people, places, and events happening in the world. These KBs have become increasingly important for a wide range of intelligent systems: from information retrieval and question answering, to Facebook's Graph Search, IBM's Watson, Google Home, and more. Previous work on learning to populate knowledge bases from text has, for the most part, made the simplifying assumption that facts remain constant over time. But this is inaccurate -- we live in a rapidly changing world. Knowledge should not be viewed as a static snapshot, but instead a rapidly evolving set of facts that must change as the world changes. In this thesis we demonstrate the feasibility of accurately identifying entity-transition-events from real-time news and social media text streams, that drive changes to a knowledge base. We use Wikipedia's revision history for distant supervision to learn event extractors, and evaluate the extractors based on their ability to predict online updates. Our weakly supervised event extractors are able to predict 10 KB revisions per month at 0.8 precision. By lowering our confidence threshold, we can suggest 34.3 correct edits per month at 0.4 precision. 64% of predicted edits were detected before they were added to Wikipedia. The average lead time of our forecasted knowledge revisions over Wikipedia's editors is 40 days, demonstrating the utility of our method for suggesting edits that can be quickly verified and added to the knowledge graph.
Committee
Alan Ritter (Advisor)
Wei Xu (Committee Member)
Pages
31 p.
Subject Headings
Computer Engineering
;
Computer Science
Keywords
knowledge bases
;
event extraction
;
social media
;
distant supervision
;
database revision history
Recommended Citations
Refworks
EndNote
RIS
Mendeley
Citations
Konovalov, A. (2018).
Predicting Knowledge Base Revisions from Realtime Text Streams
[Master's thesis, Ohio State University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=osu153200096922783
APA Style (7th edition)
Konovalov, Alexander.
Predicting Knowledge Base Revisions from Realtime Text Streams.
2018. Ohio State University, Master's thesis.
OhioLINK Electronic Theses and Dissertations Center
, http://rave.ohiolink.edu/etdc/view?acc_num=osu153200096922783.
MLA Style (8th edition)
Konovalov, Alexander. "Predicting Knowledge Base Revisions from Realtime Text Streams." Master's thesis, Ohio State University, 2018. http://rave.ohiolink.edu/etdc/view?acc_num=osu153200096922783
Chicago Manual of Style (17th edition)
Abstract Footer
Document number:
osu153200096922783
Download Count:
505
Copyright Info
© 2018, all rights reserved.
This open access ETD is published by The Ohio State University and OhioLINK.