Skip to Main Content
Frequently Asked Questions
Submit an ETD
Global Search Box
Need Help?
Keyword Search
Participating Institutions
Advanced Search
School Logo
Files
File List
wright1284152205.pdf (7.37 MB)
ETD Abstract Container
Abstract Header
Understanding User-Generated Content on Social Media
Author Info
Nagarajan, Bala Meenakshi
Permalink:
http://rave.ohiolink.edu/etdc/view?acc_num=wright1284152205
Abstract Details
Year and Degree
2010, Doctor of Philosophy (PhD), Wright State University, Computer Science and Engineering PhD.
Abstract
Over the last few years, there has been a growing public and enterprise fascination with ‘social media' and its role in modern society. At the heart of this fascination is the ability for users to participate, collaborate, consume, create and share content via a variety of platforms such as blogs, micro-blogs, email, instant messaging services, social network services, collaborative wikis, social bookmarking sites, and multimedia sharing sites. This dissertation is devoted to understanding informal user-generated textual content on social media platforms and using the results of the analysis to build Social Intelligence Applications. The body of research presented in this thesis focuses on understanding what a piece of user-generated content is ‘About' via two sub-goals of Named Entity Recognition and Key Phrase Extraction on informal text. In light of the poor context and informal nature of content on social media platforms, we investigate the role of contextual information from documents, domain models and the social medium to supplement and improve the reliability and performance of existing text mining algorithms for Named Entity Recognition and Key Phrase Extraction. In all cases we find that using multiple contextual cues together lends to reliable inter-dependent decisions, better than using the cues in isolation and that such improvements are robust across domains and content of varying characteristics, from micro-blogs like Twitter, social networking forums such as those on MySpace and Facebook, and blogs on the Web. Finally, we showcase two deployed Social Intelligence applications that build over the results of Named Entity Recognition and Key Phrase Extraction algorithms to provide near real-time information about the pulse of an online populace. Specifically, we describe what it takes to build applications that wish to exploit the ‘wisdom of the crowds'- highlighting challenges in data collection, processing informal English text, metadata extraction and presentation of the resulting information.
Committee
Amit Sheth, PhD (Committee Chair)
John Flach, PhD (Committee Member)
Daniel Gruhl, PhD (Committee Member)
Michael Raymer, PhD (Committee Member)
Shaojun Wang, PhD (Committee Member)
Kevin Haas, MS (Committee Member)
Pages
209 p.
Subject Headings
Computer Science
Keywords
social media
;
user-generated content
;
informal text analysis
;
domain knowledge
Recommended Citations
Refworks
EndNote
RIS
Mendeley
Citations
Nagarajan, B. M. (2010).
Understanding User-Generated Content on Social Media
[Doctoral dissertation, Wright State University]. OhioLINK Electronic Theses and Dissertations Center. http://rave.ohiolink.edu/etdc/view?acc_num=wright1284152205
APA Style (7th edition)
Nagarajan, Bala Meenakshi.
Understanding User-Generated Content on Social Media.
2010. Wright State University, Doctoral dissertation.
OhioLINK Electronic Theses and Dissertations Center
, http://rave.ohiolink.edu/etdc/view?acc_num=wright1284152205.
MLA Style (8th edition)
Nagarajan, Bala Meenakshi. "Understanding User-Generated Content on Social Media." Doctoral dissertation, Wright State University, 2010. http://rave.ohiolink.edu/etdc/view?acc_num=wright1284152205
Chicago Manual of Style (17th edition)
Abstract Footer
Document number:
wright1284152205
Download Count:
3,503
Copyright Info
© 2010, all rights reserved.
This open access ETD is published by Wright State University and OhioLINK.