Exploring Demographic Language Variations to ... - Semantic Scholar

I. Explore gender bias in the use of subjective language in Twitter: investigate multilingual subjective lexical variations; cross-cultural emoticon and hashtag ...
1MB Sizes 0 Downloads 139 Views
Exploring Demographic Language Variations to Improve Multilingual Sentiment Analysis in Social Media Svitlana Volkova1 , Theresa Wilson2 and David Yarowsky1,2 , 1 Center

for Language and Speech Processing, Johns Hopkins University 2 Human-Language technology Center of Excellence

Motivation

S. Volkova, T. Wilson, D. Yarowsky (JHU)

Demographic Language Variations in Twitter

2 / 35

Motivation Demographic language variations (DLV) have been studied by socio-linguists for decades (Picard, 1997; Gefen & Ridings, 2005; Holmes & Meyerhoff, 2004; Macaulay, 2006; Tagliamonte, 2006).

S. Volkova, T. Wilson, D. Yarowsky (JHU)

Demographic Language Variations in Twitter

2 / 35

Motivation Demographic language variations (DLV) have been studied by socio-linguists for decades (Picard, 1997; Gefen & Ridings, 2005; Holmes & Meyerhoff, 2004; Macaulay, 2006; Tagliamonte, 2006). DLV have been recently explored in personal email communication, blog posts, and public discussions (Boneva et al., 2001; Mohammad & Yang, 2011; Eisenstein et al., 2010; Bamman et al., 2012)

S. Volkova, T. Wilson, D. Yarowsky (JHU)

Demographic Language Variations in Twitter

2 / 35

Motivation Demographic language variations (DLV) have been studied by socio-linguists for decades (Picard, 1997; Gefen & Ridings, 2005; Holmes & Meyerhoff, 2004; Macaulay, 2006; Tagliamonte, 2006). DLV have been recently explored in personal email communication, blog posts, and public discussions (Boneva et al., 2001; Mohammad & Yang, 2011; Eisenstein et al., 2010; Bamman et al., 2012) We propose to study differences in subjective language in social media to support commercial applications:

S. Volkova, T. Wilson, D. Yarowsky (JHU)

Demographic Language Variations in Twitter

2 / 35

Motivation Demographic language variations (DLV) have been studied by socio-linguists for decades (Picard, 1997; Gefen & Ridings, 2005; Holmes & Meyerhoff, 2004; Macaulay, 2006; Tagliamonte, 2006). DLV have been recently explored in personal email communication, blog posts, and public discussions (Boneva et al., 2001; Mohammad & Yang, 2011; Eisenstein et al., 2010; Bamman et al., 2012) We propose to study differences in subjective language in social media to support commercial applications: personalized recommendation systems and targeted online advertising (Fan & Chang, 2009),

S. Volkova, T. Wilson, D. Yarowsky (JHU)

Demographic Language Variations in Twitter

2 / 35

Motivation Demographic language variations (DLV) have been studied by socio-linguists for decades (Picard, 1997; Gefen & Ridings, 2005; Holmes & Meyerhoff, 2004; Macaulay, 2006; Tagliamonte, 2006). DLV have been recently explored in personal email communication, blog posts, and public discussions (Boneva et al., 2001; Mohammad & Yang, 2011; Eisenstein et al., 2010; Bamman et al., 2012) We propose to study differences in subjective language in social media to support commercial applications: personalized recommendation systems and targeted online advertising (Fan & Chang, 2009), detecting helpful product reviews (Ott et al., 2011),

S. Volkova, T. Wilson, D. Yarowsky (JHU)

Demographic Language Variations in Twitter

2 / 35

Motivation Demographic language variations (DLV) have been studied by socio-linguists for decades (Picard, 1997; Gefen & Ridings, 2005; Holmes & Meyerhoff, 2004; Macaulay, 2006; Tagliamonte, 2006). DLV have been recently explored in personal email communication, blog posts, and public discussions (Boneva et al., 2001; Mohammad & Yang, 2011; Eisenstein et al., 2010; Bamman et al., 2012) We propose to study differences in subjective language in social media to support commercial applications: personalized recommendation systems and targeted online advertising (Fan & Chang, 2009), detecting helpful product rev