Ideological asymmetries in online hostility, intimidation, obscenity, and prejudice

Badaan, Vivienne; Hoffarth, Mark; Roper, Caroline; Parker, Taurean; Jost, John T.

doi:10.1038/s41598-023-46574-2

Ideological asymmetries in online hostility, intimidation, obscenity, and prejudice

Article
Open access
Published: 15 December 2023

Volume 13, article number 22345, (2023)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Ideological asymmetries in online hostility, intimidation, obscenity, and prejudice

Download PDF

Vivienne Badaan¹,
Mark Hoffarth²,
Caroline Roper³,
Taurean Parker³ &
…
John T. Jost ORCID: orcid.org/0000-0002-2844-4645^2,3

1561 Accesses
3 Altmetric
Explore all metrics

Abstract

To investigate ideological symmetries and asymmetries in the expression of online prejudice, we used machine-learning methods to estimate the prevalence of extreme hostility in a large dataset of Twitter messages harvested in 2016. We analyzed language contained in 730,000 tweets on the following dimensions of bias: (1) threat and intimidation, (2) obscenity and vulgarity, (3) name-calling and humiliation, (4) hatred and/or racial, ethnic, or religious slurs, (5) stereotypical generalizations, and (6) negative prejudice. Results revealed that conservative social media users were significantly more likely than liberals to use language that involved threat, intimidation, name-calling, humiliation, stereoty**, and negative prejudice. Conservatives were also slightly more likely than liberals to use hateful language, but liberals were slightly more likely than conservatives to use obscenities. These findings are broadly consistent with the view that liberal values of equality and democratic tolerance contribute to ideological asymmetries in the expression of online prejudice, and they are inconsistent with the view that liberals and conservatives are equally prejudiced.

Vindication, virtue, and vitriol

Article Open access 17 November 2020

Online hate speech and emotions on Twitter: a case study of Greta Thunberg at the UN Climate Change Conference COP25 in 2019

Article 10 March 2023

Manipulation During the French Presidential Campaign: Coordinated Inauthentic Behaviors and Astroturfing Analysis on Text and Images

Introduction

It is a common assumption in social science that, as Erikson and Tedin¹ put it, “Conservatives consider people inherently unequal and worthy of unequal rewards,” whereas “liberals are egalitarian” (p. 69). Generations of philosophers, social theorists, and political scientists have argued that a fundamental, if not the fundamental, difference between ideologues of the left and right concerns egalitarianism: liberal-leftists prioritize social, economic, and political forms of equality, whereas conservative-rightists accept existing forms of hierarchy and inequality as legitimate and necessary, and perhaps even desirable (e.g.,^{2,3,4,5,6,7,8}). A stronger commitment to equality and tolerance explains evidence that has accumulated over several decades that, on both implicit and explicit measures, political liberals express less hostility than conservatives toward a wide range of social groups that are frequent targets of prejudice in society^{5,8,9,10,11,12,13,14,15,16,17,18,19,20}.

Recently, however, the longstanding idea that liberals are more egalitarian, more tolerant, and less prejudiced than conservatives has come under attack. It has been argued that liberal-leftists are every bit as authoritarian, intolerant, and hostile toward dissimilar others as are conservative-rightists^{21,22,23,24,25,26}. The overarching claim is that leftists and rightists are equally biased, but they are just biased against different groups²⁷. There is also an untested assumption in the literature on “worldview conflict” and prejudice²⁷ that conservatives are biased against Black people and women not because of race or gender, but merely because they assume that Black people and women are liberal. Thus, whereas rightists are said to express prejudice against groups that are presumed to be left-leaning (such as Black people, atheists, and women), leftists are said to express prejudice against groups that are presumably right-leaning (such as businesspeople, Christians, and men).

The vast majority of evidence put forward on behalf of the ideological symmetry perspective is based on self-reported attitudes, such as feeling thermometer ratings of how “cold” or “warm” people feel toward specific target groups. A typical, albeit unsurprising finding is that social conservatives feel more warmth toward groups perceived as socially conservative (vs. liberal), whereas social liberals feel more warmth toward groups perceived as socially liberal²⁸. However, we think that there are several major problems with investigating ideological symmetries and asymmetries in prejudice this way²⁹.

To begin with, most of the research purporting to document ideological symmetries in prejudice merely shows that liberals and conservatives sometimes express lukewarm attitudes toward specific groups. This body of work relies upon what we consider to be a watered-down definition of prejudice as any “negative evaluation... on the basis of group membership,” which “does not depend on whether such a prejudice can be justified according to some moral code” (p. 359)³⁰. This conceptualization departs radically from “classic” definitions of prejudice in social psychology, such as Gordon Allport’s³¹ treatment of prejudice as “thinking ill of others without sufficient warrant” (p. 6), that is, “an antipathy based upon a faulty and inflexible generalization... directed toward a group as a whole, or toward an individual because he is a member of that group” (p. 9). Textbook definitions likewise emphasize “a hostile or negative attitude toward a distinguishable group based on generalizations derived from faulty or incomplete information” (p. 231)³², and “an unjustifiable (and usually negative) attitude toward a group and its members [involving] stereotyped beliefs, negative feelings, and a predisposition to discriminatory action” (p. G–10)³³. When social scientists seek to understand and ameliorate prejudice, we expect that they are not concerned merely with the expression of lukewarm attitudes but with the kind of intense, unwarranted negative affect that motivates hostility, hatred, intimidation, and discrimination (e.g.,³⁴).

To overcome limitations of previous research on the subject, and to investigate the hypothesis that liberal commitments to equality and democratic tolerance would contribute to an ideological asymmetry in expressions of hostility, intimidation, and prejudice, we conducted a large-scale investigation of naturally occurring social media behavior. Specifically, we harvested a large corpus of Twitter messages based on keywords that included social groups that, according to previous research, are common targets of liberal prejudice (e.g., Catholics, Whites, wealthy people, and conservatives) and conservative prejudice (e.g., Blacks, illegal immigrants, and liberals). In addition, we implemented a Bayesian Spatial Following model to estimate the ideological positions of Twitter users in our sample, so that we could compare the online behavior of left- and right-leaning social media users. Finally, we used a combination of manual and automatic text-coding methods to investigate ideological asymmetries in the use of language containing (1) threat and intimidation, (2) obscenity and vulgarity, (3) name-calling and humiliation, (4) hatred and racial, ethnic, or religious slurs, (5) stereotypic generalizations, and (6) negative prejudicial language. We hypothesized that: (HI) tweets mentioning liberal- or left-leaning target groups will contain more expressions of online prejudice than tweets mentioning conservative- or right-leaning target groups; and (HII) tweets sent by conservative- and right-leaning users will contain more expressions of online prejudice than tweets sent by liberal- and left-leaning users.

Method

Data collection and inclusion criteria

We used a supervised machine-learning approach to analyze naturally occurring language in a very large number of social media posts sent by liberal-leftists and conservative-rightists in reference to groups that have been identified as likely targets of liberal and conservative bias. The population of interest was the set of messages circulated in the U.S. Twittersphere. Between March and May 2016, we harvested 733,907 Twitter messages that included one or more of the 96 keywords listed in Table 1, including progressives, rightists, Christians, civil rights activists, Caucasians, Black people, destitute, and rich people. The selection of target groups was based on previous research by Chambers et al.²³ and Brandt et al.²², which sought to specify frequent targets of “liberal prejudice” and “conservative prejudice.” For each of the target groups, we included synonyms, all of which were either hashtags or keywords used on Twitter during the period of data collection. All search terms were manually inspected prior to data collection. Some of the terms were deemed by the computer scientists implementing the queries as too common on Twitter to be included in the collection, so they were excluded. To filter out tweets that contained pornographic content and those written in languages other than English, respectively, we included pornography and non-English as categories in the human coding and machine-learning phases. We excluded tweets that, through machine-learning classification, had a probability of containing pornographic content greater than 0.50 and being non-English greater than 0.50. This left us with a total sample of 670,973 tweets that were eligible for further analysis.

Table 1 Keywords used to harvest tweets for the data collection.

Full size table

Ideological estimation

We used Barberá’s method of estimating left–right (or liberal-conservative) ideological positions of Twitter users³⁶. This method, which has been validated in a number of ways, employs a Bayesian Spatial Following model that treats ideology as a latent variable estimated on the basis of follower networks, that is, the number of liberal and conservative political accounts (of well-known journalists, politicians, and other political actors) that the individual follows. We were able to calculate point estimates for a total of 325,717 Twitter users. Scores ranged from -2.611 (very liberal) to 4.668 (very conservative), with a mean of 0.369 (SD = 1.724). The mean indicated that, on average, the users in our sample were moderate (neither liberal nor conservative). Using this method, 176,948 Twitter users in our sample were classified as liberal-leaning (that is, below zero), and 148,769 were classified as conservative-leaning (above zero).

Human coding phase

To train the automatic machine-learning algorithm to classify tweets, it was necessary to first have a subset of them manually coded. Before rating the tweets that were used for the machine learning phase, all raters participated in a two-hour training session and were taught to follow the same standardized protocol (see Human Coding Manual in Supplementary Material). In the pilot coding phase, seven trained research assistants coded a total batch of 1000 tweets (500 tweets each) to assess the appropriateness of the coding instructions. We then used their feedback to make clarifications, minor revisions, and edits to the coding manual. In the next phase, 11 trained undergraduate and graduate psychology students coded an additional set of 6000 tweets. The final sample of manually coded tweets therefore consisted of N = 7000 unique tweets, with each tweet coded by at least three independent raters.

Coding categories

To establish our coding scheme, we conducted an extensive literature search on studies of online incivility and the linguistic expression of prejudice. Incivility in online discourse is operationally defined in terms of the use of disrespectful language^37,38. Disrespectful language can be broken down further into the use of obscene language and name-calling or attempts to humiliate the target of the disrespectful language. In the context of intergroup relations, incivility may also include the use of aggressive, threatening, or intimidating language. Because a main goal of our research program was to investigate ideological symmetries and asymmetries in prejudice, we estimated the prevalence of negative prejudicial language, which is underpinned by stereotypical categorical generalizations expressed in a way that renders them largely immune to counterevidence^{11,17,31,34,35}. Thus, we sought to analyze prejudicial language directed at specific target groups that are typically perceived to be left- and right-leaning, respectively. Because our dataset was harvested before Twitter expanded its policies against hate speech and hateful conduct in late 2019, we were able to investigate hatred directed at various target groups.

Therefore, research assistants coded the tweets on all of the following dimensions: (1) Threat/intimidation: language conveying a threat to use physical violence or intimidation directed at an individual or group; (2) Obscenity: an offensive word or phrase that would be considered inappropriate in professional settings; (3) Hatred: a communication that carries no meaning other than the expression of hatred for some social group; (4) Name-calling/humiliation: language directed at an individual or group that is demeaning, insulting, mocking, or intended to create embarrassment; (5) Stereotypic generalization: false or misleading generalizations about groups expressed in a manner that renders them largely immune to counterevidence; and (6) Negative prejudice: an antipathy based on group-based generalizations, that is, an unfavorable feeling “toward a person or thing, prior to, or not based on, actual experience” (p. 6)³¹.

Inter-rater reliability coefficients for each of these categories are provided in the Online Supplement (Tables S.1–S.8). We used a majority voting method, so that if two or more of the three human coders agreed that a given tweet contained hatred, obscenity, prejudice, and so on, it was classified as belonging to the positive class. Coding frequencies estimated for the training data set are summarized in Table S.9 of the Supplement for each of the six theoretical categories (plus the two screening categories).

Machine-learning phase

Training, validation, and test sets for the machine-learning phase were based on the 7000 human-coded tweets. We reserved 20% (1400) of the tweets to use as a test set to evaluate final model performance. Of the other 5600 tweets, 20% (1100) were used for purposes of validation, leaving 4500 tweets with which to train the models. We used several different text classification strategies, including “bag of words” models such as the Support Vector Machine (SVM), neural networks such as Long Short-Term Memory (LSTM), and transfer learning techniques such as Universal Language Model Fine-Tuning (ULMFiT) and Bidirectional Encoder Representations from Transformers (BERT). We applied each of these strategies to classify the tweets according to the six dimensions of classification. For the sake of brevity, we report results from the best performing model, namely BERT. Detailed information about all machine-learning methods and results are provided in the Online Supplement, along with a comparative analysis of the four machine learning models employed.

Bidirectional encoder representations from transformers

BERT is an innovative state-of-the-art language representation model

Table 4 Bivariate correlations between target ideology (groups that were perceived as more conservative/rightist) and the expression of linguistic bias overall (N = 670,973 tweets).

Full size table

Communicator effects

Next, we investigated the effects of user ideology on linguistic bias. This analysis was based on the subset of messages (n = 325,717) sent by users who could be classified as liberal or conservative. As shown in Table 5, conservative Twitter users were more likely than liberal Twitter users to communicate negative prejudice (r = 0.210), name-calling (r = 0.146), stereotypes (r = 0.110), and threatening language (r = 0.092), all ps < 0.001. Conservatives were slightly more likely to use hateful language (r = 0.011), whereas liberals were slightly more likely to use obscenity (r = −0.010); both of these effects were quite small but, because of the very large sample size, still significant at p < 0.001.

Table 5 Correlations between user ideology (twitter users who were classified as more conservative/rightist) and the expression of linguistic bias, both overall and against specific target groups.

Full size table

Communicator effects analyzed separately for liberal versus conservative target groups

Next, we inspected correlations between user ideology and linguistic bias directed at groups that were generally perceived to be liberal or left-leaning vs. conservative or right-leaning, respectively (see Table 5). For the subsample of tweets that mentioned liberal-leftist groups (n = 229,788), which comprised 70.5% of the total number of tweets in our collection, users who were classified as more conservative were more likely to express negative prejudice (r = 0.247), to engage in name-calling (r = 0.191), and to include threats (r = 0.123), stereotypes (r = 0.116), and hatred (r = 0.021), all ps < 0.001. There was no effect of user ideology on the use of obscenity (r = 0.003, p = 0.119).

For the much smaller subsample of tweets that mentioned conservative-rightist groups (n = 95,929), more liberal users were slightly more likely to express obscenity (r = −0.047) and hatred (r = −0.026), both ps < 0.001. However, for the remaining categories, conservative Twitter users were actually more likely than liberal Twitter users to express linguistic bias. That is, even when writing about groups that are generally considered to be right-leaning, conservatives were more likely to communicate negative prejudice (r = 0.118), stereotypes (r = 0.096), name-calling (r = 0.025), and threatening language (r = 0.021), all ps < 0.001.

Sensitivity analyses

We conducted additional sensitivity analyses to determine whether the results and their interpretation was impacted by analytic decisions. Specifically, we re-coded the continuous estimates for linguistic bias into binary, categorical variables (< 50% probability = does not contain biased language, ≥ 50% probability = does not contain biased language) and conducted regression analyses. Results were very similar to those described above.

Tweets that mentioned liberal-leaning groups were more likely to contain hatred (b = − 0.45, SE(b) = 0.008, Wald = 2833.26), threats (b = − 0.26, SE(b) = 0.006, Wald = 1697.99), obscenity (b = − 0.23, SE(b) = 0.006, Wald = 1710.62), name calling (b = − 0.36, SE(b) = 0.004, Wald = 6783.19), stereotypes (b = − 0.22, SE(b) = 0.004, Wald = 2480.92), and negative prejudice (b = − 0.272, SE(b) = 0.003, Wald = 6386.04), all ps < 0.001.

We also compared the frequencies (percentages) of messages about various target groups that contained each type of linguistic bias. Tweets about left-leaning (vs. right-leaning) groups were again more likely to contain hatred (3.5% vs. 0.5%, χ² = 6882.25), threats (4.1% vs. 2.9%, χ² = 749.48), obscenity (5.8% vs. 2.8%, χ² = 3521.28), name calling (10.4% vs. 4.8%, χ² = 7342.45), stereotypes (7.9% vs. 5.7%, χ² = 1254.16), and negative prejudice (15.4% vs. 10.1%, χ² = 4205.00), all ps < 0.001.

Finally, we examined whether user ideology was related to the percentage of messages containing linguistic bias. Tweets sent by more conservative users had a higher probability of containing hateful language (b = 0.049, SE(b) = 0.008, Wald = 35.76), threats (b = 0.225, SE(b) = 0.005, Wald = 2055.62), name calling (b = 0.210, SE(b) = 0.003, Wald = 3766.95), stereotypes (b = 0.134, SE(b) = 0.003, Wald = 1475.32), and negative prejudice (b = 0.26, SE(b) = 0.003, Wald = 9125.68), all ps < 0.001. There was no statistically significant effect of user ideology in the use of obscene language (b = − 0.007, SE(b) = 0.006, Wald = 1.81, p = 0.179).

Ideology of the coders

Because we were concerned that the political orientations of the raters could bias their coding, we asked the research assistants to answer three questions about their general political orientation (“Please indicate on the scale below how liberal or conservative [in terms of your general outlook] you are”), social attitudes (“How liberal or conservative do you tend to be when it comes to social policy?”), and economic attitudes (“How liberal or conservative do you tend to be when it comes to economic policy?”). Responses could range from 1 (very liberal) to 7 (very conservative). The 8 (of 11) raters who answered these questions were liberal leaning on average, M = 2.46 (SD = 1.05).

We examined point-biserial correlations between coders’ ideology scores and their rating of each linguistic category under study for every batch of tweets. We found that rater ideology was unrelated to the criterion linguistic category used to train the machine learning algorithm, i.e., hateful language (r = 0.009, p = 0.139). Rater ideology was also unrelated to the detection of threatening language in the training tweets (r = 0.011, p = 0.079). At the same time, the more conservative our raters were, the more likely they were to detect obscenity (r = 0.022, p < 0.001), whereas the more liberal our raters were, the more likely they were to detect name-calling (r = − 0.028, p < 0.001), stereotypes (r = − 0.136, p < 0.001), and negative prejudice (r = − 0.111, p < 0.001). Thus, coder ideology was inconsistently related to the use of various coding categories. Most importantly, ideology of the raters was unrelated to their ratings of hatred, which was used as the base linguistic model for training the other categories. It is also worth highlighting the fact that the classification and labeling process for the machine learning training relied on majority voting, so that at least two annotators must have agreed that the tweets contained hatred, obscenity, etc., before it was labeled as belonging to the positive class.

General discussion

Summary of findings and their implications

In this study, we investigated the question of whether online prejudice is symmetrical or asymmetrical on the political left and right in the U.S. in a very large sample of social media messages. We observed that Twitter messages mentioning targets perceived as liberal or left-leaning (such as Black Americans and feminists) included higher levels of hate speech, threat, obscenity, name-calling, stereoty**, and negative prejudice, compared to Twitter messages mentioning targets perceived as conservative or right-leaning (such as conservatives and Christians). These results supported (HI).

We estimated user ideology scores based on Barberá’s method³⁶ and observed that whereas liberal users were slightly more likely than their conservative counterparts to use obscene language, conservatives were more likely to use negative prejudice, name-calling, and hateful and threatening language, although the effect sizes for the last two categories were very small. Perhaps the most important finding is that conservatives were more likely than liberals to use negative prejudicial language, and that negative prejudice was expressed more strongly in tweets mentioning purportedly left-leaning targets than in tweets mentioning right-leaning targets. These results are clearly consistent with (HII) and inconsistent with the alternative hypothesis that prejudice is symmetrical on the left and right^{21,22,23,24,25,26,27,28,42}. Instead, they reinforce the long-standing, empirically supported conclusion that out-group prejudice is more prevalent on the right than the left^{9,10,11,12,13,14,15,17,18,19,29}.

Because we measured the spontaneous use of language in a naturally occurring “real-world” setting, our results go well beyond what can be concluded based on studies using feeling thermometer measures of prejudice, which are subject to norms of socially desirable responding (for a critique of previous research in this area, see²⁹). Our findings are also consistent with two other major studies of prejudicial outcomes in society. First, an analysis of FBI hate-crime data from 1996 to 2018 revealed that ostensibly left-leaning targets such as racial, religious, and sexual minorities were subjected to much higher levels of hate crime than ostensibly right-leaning targets, such as racial, religious, and sexual majorities²⁹. Thus, group-based discrimination, which is an obvious manifestation of out-group prejudice, disproportionately affects disadvantaged target groups who are perceived as left-leaning in political orientation. Second, a comprehensive study of political violence carried out in the US between 1948 and 2018 showed that individuals who were affiliated with left-wing extremist movements had 68% lower odds of engaging in violent behavior, compared to individuals affiliated with right-wing extremist movements¹³. Thus, in these previous investigations, and in our present study, rightists were much more likely to be perpetrators of prejudice, and leftists were much more likely to be victims of prejudice. This is consistent with the view that substantial left–right ideological asymmetries exist when it comes to the thoughts, feelings, and behaviors of individuals and the social groups to which they belong (see⁵).

Strengths and limitations

One strength of the present research program, which we alluded to above, is that it is high in external validity. This is because we unobtrusively observed the spontaneous language used by liberals and conservatives in actual social media communications referring to target groups that are perceived as left-leaning vs. right-leaning. Furthermore, by observing the expression of prejudice in vivo, focusing on naturally produced language, we avoided several common methodological artifacts that frequently hamper social psychological research on bias and prejudice, such as problems of experimenter bias and socially desirable responding. Another advantage of this study is that the final sample size of messages analyzed was very large (N = 670,973), rendering our estimates both highly stable and robust.

Yet another strength of our study is that we used cutting-edge machine learning methods in data science to investigate social psychological hypotheses and, in particular, to classify linguistic phenomena, such as the expression of negative prejudice, that have historically been very difficult to classify using objective methods. In the process of develo** our computational model, we generated a set of 7000 labelled tweets that is available for future researchers to train their own machine learning models. All of these tweets were rated by three different human coders, so that we could ensure high levels of interrater reliability before training our various machine-learning algorithms. Although the procedure was both time- and resource-intensive, it increased the accuracy of predictions made by the machine-learning models. We have emphasized results based on the best-performing algorithmic model (BERT) in this article, but the data scientists on our team tested and fine-tuned four different classification models. The methods and results associated with these other algorithms are described in the Online Supplement.

Of course, this study also has its limitations. For one thing, the Twitter API limited the number of data queries we were able to submit during the period of data collection, which means that the dataset does not include all potentially relevant tweets sent during the period of investigation. However, we were able to collect a random sample of the total population of tweets sent during the period in question. The Twitter messages we harvested were from March to May of 2016. This was before the primary and presidential elections of 2016, which means that it was prior to Donald Trump’s nomination and eventual election to the presidency. Given the intensity of Trump’s public rhetoric against many of the left-leaning target groups listed in Table 1 (especially immigrants, racial minorities, liberals, and leftists), and the uptick in hate crimes and other cases of prejudice and discrimination that accompanied his presidency, e.g., see^43,44,45,46, the timing of our investigation means that we may have underestimated the true extent of online bias and harassment committed by rightists against target groups that are perceived as left-leaning in the period that immediately followed our investigation.

Another technical limitation concerns the performance of our optimal machine learning algorithm. Although the algorithm had high f-scores with respect to hatred, obscenity, and name calling, it performed less than optimally with respect to the categories of negative prejudice, threat, and stereoty**. This could be attributable to (a) the difficulty in detecting relatively “fuzzy” concepts; (b) the fact that our operationalization of stereotypes included all group-based generalizations, not only negative group-based generalizations; and/or (c) an insufficient amount of training data, although the research team coded as many tweets as was logistically feasible giving timing and other constraints. Future research would do well to overcome these limitations by (a) using sentiment analysis to code the valence of the attitudes in the tweets; (b) focusing exclusively on negative stereotypes; and (c) annotating a larger corpus of training tweets. Despite the limitations of our study, we believe that it is the first of its kind to use robust machine-learning models to assess multiple indicators of online prejudice.

As in every other study of social media communication, our analysis is highly dependent upon the selection of keywords and search terms used to construct the data set. We first selected social groups based on previous research to identify potential targets of “liberal prejudice” and “conservative prejudice” and then generated synonyms for those groups^22,23. However, some words and phrases (such as “Democrats” and “Republicans”) were determined by our computer technicians to occur too frequently in the total population of tweets; these were dropped to make the data collection more manageable. Although this did introduce some degree of selectivity in the search terms used, we note that the data set is based on 96 words and phrases, which is an extremely large sample of keywords compared to other studies of online hostility and prejudice.

The non-experimental study design prohibits the drawing of causal conclusions about the nature of ideology and prejudice. Moreover, there are several third variables—such as intelligence, education, authoritarianism, social dominance orientation, system justification, and the like—that may help to explain why conservative-rightists express more online prejudice than liberal-leftists (e.g., see^{5,8,9,10,12,47,48}). Future research would do well to measure these as mediating or moderating variables.

The fact that our analyses are confined to a single social media platform is yet another limitation. Because Twitter changes its policies regarding the removal of potentially prejudicial content every few years, our analysis was bounded by their terms of service during the period of data collection. According to the results of a Pew Survey in 2021, Twitter users tend to be younger and more Democratic, compared to the public at large. Therefore, although our sample is much larger and more representative of the general population than in studies of prejudice based on convenience samples, we do not know how well these results would generalize to the population of U.S. adults.

It would be useful to conduct parallel studies about the role of political ideology in the expression of prejudice on other platforms, such as Facebook, Instagram, and Reddit, as well as social media channels that are favored by right-wingers, such as 4chan, Parler, and Trump’s own social media platform, Truth Social. Some of these more recent social media platforms (especially Parler and Truth Social) were created specifically to combat what right-wing opinion leaders claimed to be a crackdown on free speech. On such platforms, hateful and prejudicial language may be entirely unfiltered, making them well-suited for empirical research into the connection between ideology and online prejudice.

Concluding remarks

We believe that it is an appropriate time for social scientists to take stock and reflect on the question of how and why it is we study prejudice and discrimination in the first place. Initially, research in this area arose from the (belated) historical acknowledgement of exploitation and oppression faced by certain groups, such as racial, ethnic, religious, and sexual minorities, and perpetuated, generally speaking, by members of majority groups that were relatively high in social status, power, and material resources (e.g.,^9,31,49). Many recent contributions to the debate about ideological symmetry vs. asymmetry in bias and prejudice are strikingly ahistorical and, it seems to us, lacking an appreciation of structural inequalities in society (e.g.,^{21,22,23,24,25,26,27,28,42}). We contend that it is impossible to properly understand these phenomena without appreciating the significance of both longstanding and current imbalances of power and material resources in the overarching social system (e.g., see^5,48). Our research program is offered as a wake-up call to those who would seek to strip the study of prejudice of its historical and social-structural origins in a naïve and, indeed, we would argue, ultimately futile attempt to de-politicize and “neutralize” the subject matter (see also⁵⁰ for a similar critique of symmetrical approaches to the study of political polarization).

Data availability

Data from the human coding phase, as well as the final dataset from the machine learning phase are available via https://osf.io/6kj5s/?view_only=67175bcd980c444bbf47fb5b44dd8424.

References

Erikson, R. S. & Tedin, K. L. American Public Opinion: Its Origins, Content, and Impact (Routledge, 2019).
Book Google Scholar
Bobbio, N. Left and Right: The Significance of Political Distinction (University of Chicago Press, 1996).
Google Scholar
Inglehart, R. F. Culture Shift in Advanced Industrial Society (Princeton University Press, 1990).
Book Google Scholar
Jacoby, W. G. Is there a culture war? Conflicting value structures in American public opinion. Am. Polit. Sci. Rev. 108, 754–771. https://doi.org/10.1017/S0003055414000380 (2014).
Article Google Scholar
Jost, J. T. Left & Right: The Psychological Significance of Political Distinctions (Oxford University Press, 2021).
Google Scholar
Lipset, S. M., Lazarsfelt, P., Barton, A. & Linz, J. The psychology of voting: An analysis of political behavior. In Handbook of Social Psychology (ed. Lindzey, G.) 1124–1175 (Addison Wesley, 1954/1962).
Lupton, R. N., Smallpage, S. M. & Enders, A. M. Values and political predispositions in the age of polarization: Examining the relationship between partisanship and ideology in the United States, 1988–2012. Br. J. Soc. Psychol. 50, 241–260. https://doi.org/10.1017/S0007123417000837 (2020).
Article Google Scholar
Sidanius, J. & Pratto, F. Social Dominance: An Intergroup Theory of Social Hierarchy and Oppression (Cambridge University Press, 2001).
Google Scholar
Adorno, T., Frenkel-Brenswick, E., Levinson, D. J. & Sanford, R. N. The Authoritarian Personality (Harpers, 1950).
Google Scholar
Cunningham, W. A., Nezlek, J. B. & Banaji, M. R. Implicit and explicit ethnocentrism: Revisiting the ideologies of prejudice. Pers. Soc. Psychol. B 30, 1332–1346. https://doi.org/10.1177/0146167204264654 (2004).
Article Google Scholar
Federico, C. M. & Sidanius, J. Racism, ideology, and affirmative action revisited: The antecedents and consequences of “principled objections” to affirmative action. J. Pers. Soc. Psychol. 82, 488–502. https://doi.org/10.1037/0022-3514.82.4.488 (2002).
Article PubMed Google Scholar
Hodson, G. & Busseri, M. A. Bright minds and dark attitudes: Lower cognitive ability predicts greater prejudice through right-wing ideology and low intergroup contact. Psychol. Sci. 23, 187–195. https://doi.org/10.1177/0956797611421206 (2012).
Article PubMed Google Scholar
Jasko, K., LaFree, G., Piazza, J. & Becker, M. H. A comparison of political violence by left-wing, right-wing, and Islamist extremists in the United States and the world. Proc. Natl. Acad. Sci. USA 119, e2122593119. https://doi.org/10.1073/pnas.2122593119 (2022).
Article CAS PubMed PubMed Central Google Scholar
Jost, J. T., Banaji, M. R. & Nosek, B. A. A decade of system justification theory: Accumulated evidence of conscious and unconscious bolstering of the status quo. Polit. Psychol. 25, 881–919. https://doi.org/10.1111/j.1467-9221.2004.00402.x (2004).
Article Google Scholar
Kite, M. E. & Whitley, B. E. Psychology of Prejudice and Discrimination (Routledge, 2016).
Book Google Scholar
Rokeach, M. The Open and Closed Mind: Investigations into the Nature of Belief Systems and Personality Systems (Basic Books, 1960).
Google Scholar
Sears, D. O. & Henry, P. J. The origins of symbolic racism. J. Pers. Soc. Psychol. 85, 259–275. https://doi.org/10.1037/0022-3514.85.2.259 (2003).
Article PubMed Google Scholar
Sibley, C. G. & Duckitt, J. Personality and prejudice: A meta-analysis and theoretical review. Pers. Soc. Psychol. Rev. 12, 248–279. https://doi.org/10.1177/1088868308319226 (2008).
Article PubMed Google Scholar
Sidanius, J., Pratto, F. & Bobo, L. Racism, conservatism, affirmative action, and intellectual sophistication: A matter of principled conservatism or group dominance?. J. Pers. Soc. Psychol. 70, 476–490. https://doi.org/10.1037/0022-3514.70.3.476 (1996).
Article Google Scholar
Morehouse, K. N., Maddox, K. & Banaji, M. R. All human social groups are human, but some are more human than others: A comprehensive investigation of the implicit association of “Human” to US racial/ethnic groups. PNAS https://doi.org/10.1073/pnas.2300995120 (2023).
Article PubMed PubMed Central Google Scholar
Brandt, M. J. Predicting ideological prejudice. Psychol. Sci. 28, 713–722. https://doi.org/10.1177/0956797617693004 (2017).
Article PubMed Google Scholar
Brandt, M. J., Reyna, C., Chambers, J. R., Crawford, J. T. & Wetherell, G. The ideological-conflict hypothesis: Intolerance among both liberals and conservatives. Curr. Dir. Psychol. Sci. 23, 27–34. https://doi.org/10.1177/0963721413510932 (2014).
Article Google Scholar
Chambers, J. R., Schlenker, B. R. & Collisson, B. Ideology and prejudice: The role of value conflicts. Psychol. Sci. 24, 140–149. https://doi.org/10.1177/0956797612447820 (2013).
Article PubMed Google Scholar
Conway, L. G. III., Houck, S. C., Gornick, L. J. & Repke, M. A. Finding the Loch Ness monster: Left-wing authoritarianism in the United States. Polit. Psychol. 39, 1049–1067. https://doi.org/10.1111/pops.12470 (2018).
Article Google Scholar
Costello, T. H., Clark, C. J. & Tetlock, P. E. Shoring up the shaky psychological foundations of a micro-economic model of ideology: Adversarial collaboration solutions. Psychol. Inq. 33, 88–94. https://doi.org/10.1080/1047840X.2022.2065130 (2022).
Article Google Scholar
Crawford, J. T. & Pilanski, J. M. Political intolerance, right and left. Polit. Psychol. 35, 841–851. https://doi.org/10.1111/j.1467-9221.2012.00926.x (2014).
Article Google Scholar
Brandt, M. J. & Crawford, J. T. Worldview conflict and prejudice. Adv. Exp. Soc. Psychol. 61, 1–66. https://doi.org/10.1016/bs.aesp.2019.09.002 (2020).
Article CAS Google Scholar
Crawford, J. T., Brandt, M. J., Inbar, Y., Chambers, J. R. & Motyl, M. Social and economic ideologies differentially predict prejudice across the political spectrum, but social issues are most divisive. J. Pers. Soc. Psychol. 112, 383–412. https://doi.org/10.1037/pspa0000074 (2017).
Article PubMed Google Scholar
Badaan, V. & Jost, J. T. Conceptual, empirical, and practical problems with the claim that intolerance, prejudice, and discrimination are equivalent on the political left and right. Curr. Opin. Behav. Sci. 34, 229–238. https://doi.org/10.1016/j.cobeha.2020.07.007 (2020).
Article Google Scholar
Crandall, C. S., Eshleman, A. & Orien, L. Social norms and the expression and suppression of prejudice: The struggle for internalization. J. Pers. Soc. Psychol. 82, 359–378. https://doi.org/10.1037/0022-3514.82.3.359 (2002).
Article PubMed Google Scholar
Allport, G. W. The Nature of Prejudice (Addison-Wesley, 1954/1990).
Aronson, E. The Social Animal (W. H. Freeman and Company, 1988).
Google Scholar
Myers, D. G. Social Psychology (McGraw-Hill, 1995).
Google Scholar
Dovidio, J. F., Gaertner, S. L. & Pearson, A. R. On the nature of prejudice: The psychological foundations of hate. In The Psychology of Hate (ed. Sternberg, R. J.) 211–234 (American Psychological Association, 2005).
Chapter Google Scholar
Blum, L. Stereotypes and stereoty**: A moral analysis. Philos. Pap. 33, 251–289. https://doi.org/10.1080/05568640409485143 (2004).
Article Google Scholar
Barberá, P. Birds of the same feather tweet together: Bayesian ideal point estimation using Twitter data. Polit. Anal. 23, 76–91. https://doi.org/10.1093/pan/mpu011 (2015).
Article Google Scholar
Coe, K., Kenski, K. & Rains, S. A. Online and uncivil? Patterns and determinants of incivility in newspaper website comments. J. Commun. 64, 658–679 (2014).
Article Google Scholar
Rossini, P. G. C. Disentangling uncivil and intolerant discourse. In A Crisis of Civility? Contemporary Research on Civility, Incivility and Political Discourse (eds Boatright, R. G. et al.) 142–215 (Routledge, 2019).
Google Scholar
Devlin, J., Chang, M. W., Lee, K. & Toutanova, K. (2018). BERT: Pre-training of deep bidirectional transformers for language understanding. ar**v:1810.04805 (2018). https://doi.org/10.48550/ar**v.1810.04805
Vaswani, A. et al. Attention is all you need. Adv. Neural Inf. Process. Syst. 30, 5998–6008 (2017).
Google Scholar
Wolf, T. & Sand, V. . PyTorch Pretrained BERT (2018). https://github.com/huggingface/pytorch-pretrained-BERT
Wetherell, G. A., Brandt, M. J. & Reyna, C. Discrimination across the ideological divide: The role of value violations and abstract values in discrimination by liberals and conservatives. Soc. Psychol. Pers. Sci. 4, 658–667. https://doi.org/10.1177/1948550613476096 (2013).
Article Google Scholar
Crandall, C. S., Miller, J. M. & White, M. H. Changing norms following the 2016 US presidential election: The Trump effect on prejudice. Soc. Psychol. Pers. Sci. 9, 186–192 (2018).
Article Google Scholar
Edwards, G. S. & Rushin, S. The effect of President Trump’s election on hate crimes. SSRN. https://doi.org/10.2139/ssrn.3102652
Newman, B. et al. The Trump effect: An experimental investigation of the emboldening effect of racially inflammatory elite communication. Br. J. Polit. Sci. 51, 1138–1159. https://doi.org/10.1017/S0007123419000590 (2021).
Article Google Scholar
Ruisch, B. C. & Ferguson, M. J. Changes in Americans’ prejudices during the presidency of Donald Trump. Nat. Hum. Behav. 6, 656–665. https://doi.org/10.1038/s41562-021-01287-2 (2022).
Article PubMed Google Scholar
Ganzach, Y. & Schul, Y. Partisan ideological attitudes: Liberals are tolerant; the intelligent are intolerant. J. Pers. Soc. Psychol. 120, 1551–1566. https://doi.org/10.1037/pspi0000324 (2021).
Article PubMed Google Scholar
Jost, J. T. A Theory of System Justification (Harvard University Press, 2020).
Book Google Scholar
Myrdal, G. An American Dilemma: The Negro Problem and Modern Democracy (Harper, 1944).
Google Scholar
Kreiss, D. & McGregor, S. C. A review and provocation: On polarization and platforms. New Media Soc. 14, 61. https://doi.org/10.1177/1461444823116188 (2023).
Article Google Scholar

Download references

Acknowledgements

The authors gratefully acknowledge financial support from the INSPIRE program of the National Science Foundation (Award # SES-1248077 and SES-1248077-001). We would also like to thank the 11 research assistants who manually annotated thousands of tweets for this project.

Author information

Authors and Affiliations

Department of Psychology, American University of Beirut, Beirut, Lebanon
Vivienne Badaan
Department of Psychology, New York University, 6 Washington Place, 5th Floor, New York, NY, 10003, USA
Mark Hoffarth & John T. Jost
Center for Data Science, New York University, 726 Broadway, 7th Floor, New York, NY, 10003, USA
Caroline Roper, Taurean Parker & John T. Jost

Authors

Vivienne Badaan
View author publications
You can also search for this author in PubMed Google Scholar
Mark Hoffarth
View author publications
You can also search for this author in PubMed Google Scholar
Caroline Roper
View author publications
You can also search for this author in PubMed Google Scholar
Taurean Parker
View author publications
You can also search for this author in PubMed Google Scholar
John T. Jost
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

V.B., J.T.J., and M.H. contributed to theorizing, research design, and writing of the manuscript. C.R. and T.P. were responsible for the machine-learning portion and drafting the corresponding section on methodology. V.B. and M.H. analyzed the data. V.B prepared the tables, figures, and supplementary materials.

Corresponding author

Correspondence to Vivienne Badaan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Badaan, V., Hoffarth, M., Roper, C. et al. Ideological asymmetries in online hostility, intimidation, obscenity, and prejudice. Sci Rep 13, 22345 (2023). https://doi.org/10.1038/s41598-023-46574-2

Download citation

Received: 12 December 2022
Accepted: 02 November 2023
Published: 15 December 2023
DOI: https://doi.org/10.1038/s41598-023-46574-2
Springer Nature Limited

Associated content

Social biases and discrimination

Collection 28 June 2022

Ideological asymmetries in online hostility, intimidation, obscenity, and prejudice

Abstract

Similar content being viewed by others

Vindication, virtue, and vitriol

Online hate speech and emotions on Twitter: a case study of Greta Thunberg at the UN Climate Change Conference COP25 in 2019

Manipulation During the French Presidential Campaign: Coordinated Inauthentic Behaviors and Astroturfing Analysis on Text and Images

Introduction

Method

Data collection and inclusion criteria

Ideological estimation

Human coding phase

Coding categories

Machine-learning phase

Bidirectional encoder representations from transformers

Communicator effects

Communicator effects analyzed separately for liberal versus conservative target groups

Sensitivity analyses

Ideology of the coders

General discussion

Summary of findings and their implications

Strengths and limitations

Concluding remarks

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Social biases and discrimination

Navigation

Ideological asymmetries in online hostility, intimidation, obscenity, and prejudice

Abstract

Similar content being viewed by others

Vindication, virtue, and vitriol

Online hate speech and emotions on Twitter: a case study of Greta Thunberg at the UN Climate Change Conference COP25 in 2019

Manipulation During the French Presidential Campaign: Coordinated Inauthentic Behaviors and Astroturfing Analysis on Text and Images

Introduction

Method

Data collection and inclusion criteria

Ideological estimation

Human coding phase

Coding categories

Machine-learning phase

Bidirectional encoder representations from transformers

Communicator effects

Communicator effects analyzed separately for liberal versus conservative target groups

Sensitivity analyses

Ideology of the coders

General discussion

Summary of findings and their implications

Strengths and limitations

Concluding remarks

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation