Skip to Main Content
 

Global Search Box

 
 
 
 

ETD Abstract Container

Abstract Header

Understanding User-Generated Content on Social Media

Nagarajan, Bala Meenakshi

Abstract Details

2010, Doctor of Philosophy (PhD), Wright State University, Computer Science and Engineering PhD.
Over the last few years, there has been a growing public and enterprise fascination with ‘social media' and its role in modern society. At the heart of this fascination is the ability for users to participate, collaborate, consume, create and share content via a variety of platforms such as blogs, micro-blogs, email, instant messaging services, social network services, collaborative wikis, social bookmarking sites, and multimedia sharing sites. This dissertation is devoted to understanding informal user-generated textual content on social media platforms and using the results of the analysis to build Social Intelligence Applications. The body of research presented in this thesis focuses on understanding what a piece of user-generated content is ‘About' via two sub-goals of Named Entity Recognition and Key Phrase Extraction on informal text. In light of the poor context and informal nature of content on social media platforms, we investigate the role of contextual information from documents, domain models and the social medium to supplement and improve the reliability and performance of existing text mining algorithms for Named Entity Recognition and Key Phrase Extraction. In all cases we find that using multiple contextual cues together lends to reliable inter-dependent decisions, better than using the cues in isolation and that such improvements are robust across domains and content of varying characteristics, from micro-blogs like Twitter, social networking forums such as those on MySpace and Facebook, and blogs on the Web. Finally, we showcase two deployed Social Intelligence applications that build over the results of Named Entity Recognition and Key Phrase Extraction algorithms to provide near real-time information about the pulse of an online populace. Specifically, we describe what it takes to build applications that wish to exploit the ‘wisdom of the crowds'- highlighting challenges in data collection, processing informal English text, metadata extraction and presentation of the resulting information.
Amit Sheth, PhD (Committee Chair)
John Flach, PhD (Committee Member)
Daniel Gruhl, PhD (Committee Member)
Michael Raymer, PhD (Committee Member)
Shaojun Wang, PhD (Committee Member)
Kevin Haas, MS (Committee Member)
209 p.

Recommended Citations

Citations

  • Nagarajan, B. M. (2010). Understanding User-Generated Content on Social Media [Doctoral dissertation, Wright State University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=wright1284152205

    APA Style (7th edition)

  • Nagarajan, Bala Meenakshi. Understanding User-Generated Content on Social Media. 2010. Wright State University, Doctoral dissertation. OhioLINK Electronic Theses and Dissertations Center, http://rave.ohiolink.edu/etdc/view?acc_num=wright1284152205.

    MLA Style (8th edition)

  • Nagarajan, Bala Meenakshi. "Understanding User-Generated Content on Social Media." Doctoral dissertation, Wright State University, 2010. http://rave.ohiolink.edu/etdc/view?acc_num=wright1284152205

    Chicago Manual of Style (17th edition)