Skip to Main Content
 

Global Search Box

 
 
 
 

ETD Abstract Container

Abstract Header

Linguistic Approach to Information Extraction and Sentiment Analysis on Twitter

Nepal, Srijan

Abstract Details

2012, MS, University of Cincinnati, Engineering and Applied Science: Computer Science.

Social media sites are one of the most popular destinations in today’s online world. With millions of users visiting social networking sites like Facebook, YouTube, Twitter etc. every day to share social content at their disposal; from simple textual information about what they are doing at any moment of time, to opinions regarding products, people, events, movies to videos and music, these sites have become massive sources of user generated content. In this work we focus on one such social networking site - Twitter, for the task of information extraction and sentiment analysis.

This work presents a linguistic framework that first performs syntactic normalization of tweets on top of traditional data cleaning, extracts assertions from each tweet in the form of binary relations, and creates a contextualized knowledge base (KB). We then present a Language Model (LM) based classifier trained on a small set of manually tagged corpus, to perform sentence level sentiment analysis on the collected assertions to eventually create a KB that is backed by sentiment values. We use this approach to implement a contextualized sentiment based yes/no question answering system.

Kenneth Berman, PhD (Committee Chair)
Fred Annexstein, PhD (Committee Member)
Anca Ralescu, PhD (Committee Member)
68 p.

Recommended Citations

Citations

  • Nepal, S. (2012). Linguistic Approach to Information Extraction and Sentiment Analysis on Twitter [Master's thesis, University of Cincinnati]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=ucin1342544962

    APA Style (7th edition)

  • Nepal, Srijan. Linguistic Approach to Information Extraction and Sentiment Analysis on Twitter. 2012. University of Cincinnati, Master's thesis. OhioLINK Electronic Theses and Dissertations Center, http://rave.ohiolink.edu/etdc/view?acc_num=ucin1342544962.

    MLA Style (8th edition)

  • Nepal, Srijan. "Linguistic Approach to Information Extraction and Sentiment Analysis on Twitter." Master's thesis, University of Cincinnati, 2012. http://rave.ohiolink.edu/etdc/view?acc_num=ucin1342544962

    Chicago Manual of Style (17th edition)