Skip to Main Content
Frequently Asked Questions
Submit an ETD
Global Search Box
Need Help?
Keyword Search
Participating Institutions
Advanced Search
School Logo
Files
File List
Jeniya_Dissertation_final.pdf (3.05 MB)
ETD Abstract Container
Abstract Header
Information Extraction From User Generated Noisy Texts
Author Info
Tabassum Binte Jafar, Jeniya
Permalink:
http://rave.ohiolink.edu/etdc/view?acc_num=osu1606315356821532
Abstract Details
Year and Degree
2020, Doctor of Philosophy, Ohio State University, Computer Science and Engineering.
Abstract
Social media websites provide an ideal environment for people to express their experiences on the latest events and share their knowledge about the current technologies along with research advancements. This presents an opportunity for Natural Language Processing (NLP) and Information Extraction (IE) technology to facilitate large scale data-analysis applications by extracting machine-processable information from user generated unstructured texts. However, information extraction from social media is particularly challenging due to the inherent noise induced by different writing styles of its users and their writing errors such as: typos and non-grammatical sentences. In this thesis, we explore the supervised and semi-supervised approaches to extract structured information from the noisy user generated texts of three widely used social web spaces: Twitter, StackOverflow and ProtocolIO.
Committee
Wei Xu (Advisor)
Alan Ritter (Advisor)
Feng Qin (Advisor)
Pages
115 p.
Subject Headings
Artificial Intelligence
;
Computer Engineering
;
Computer Science
;
Linguistics
Keywords
Natural Language Processing
;
Information Retrieval
;
Machine Learning
;
Social Media Analysis
Recommended Citations
Refworks
EndNote
RIS
Mendeley
Citations
Tabassum Binte Jafar, J. (2020).
Information Extraction From User Generated Noisy Texts
[Doctoral dissertation, Ohio State University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=osu1606315356821532
APA Style (7th edition)
Tabassum Binte Jafar, Jeniya.
Information Extraction From User Generated Noisy Texts.
2020. Ohio State University, Doctoral dissertation.
OhioLINK Electronic Theses and Dissertations Center
, http://rave.ohiolink.edu/etdc/view?acc_num=osu1606315356821532.
MLA Style (8th edition)
Tabassum Binte Jafar, Jeniya. "Information Extraction From User Generated Noisy Texts." Doctoral dissertation, Ohio State University, 2020. http://rave.ohiolink.edu/etdc/view?acc_num=osu1606315356821532
Chicago Manual of Style (17th edition)
Abstract Footer
Document number:
osu1606315356821532
Download Count:
438
Copyright Info
© 2020, all rights reserved.
This open access ETD is published by The Ohio State University and OhioLINK.