Abstract
Intimate partner violence (IPV) is a significant public health problem that adversely affects the well-being of victims. IPV is often under-reported and non-physical forms of violence may not be recognized as IPV, even by victims. With the increasing popularity of social media and due to the anonymity provided by some of these platforms, people feel comfortable sharing descriptions of their relationship problems in social media. The content generated in these platforms can be useful in identifying IPV and characterizing the prevalence, causes, consequences, and correlates of IPV in broad populations. However, these descriptions are in the form of free text and no corpus of labeled data is available to perform large-scale computational and statistical analyses. Here, we use data from established questionnaires that are used to collect self-report data on IPV to train machine learning models to predict IPV from free text. Using Universal Sentence Encoder (USE) along with multiple machine learning algorithms (random forest, SVM, logistic regression, Naïve Bayes), we develop DetectIPV, a tool for detecting IPV in free text. Using DetectIPV, we comprehensively characterize the predictability of different types of violence (physical abuse, emotional abuse, sexual abuse) from free text. Our results show that a general model that is trained using examples of all violence types can identify IPV from free text with area under the ROC curve (AUROC) 89%. We also train type-specific models and observe that physical abuse can be identified with greatest accuracy (AUROC 98%), while sexual abuse can be identified with high precision but relatively low recall. While our results indicate that the prediction of emotional abuse is the most challenging, DetectIPV can identify emotional abuse with AUROC above 80%. These results establish DetectIPV as a tool that can be used to reliably detect IPV in the context of various applications, ranging from flagging social media posts to detecting IPV in large text corpuses for research purposes. DetectIPV is available as a web service at https://www.ipvlab.case.edu/ipvdetect/.
Similar content being viewed by others
References
National Center for Injury Prevention and Control, Division of Violence Prevention. (2021). Preventing intimate partner violence fact sheet. Centers for Disease Control and Prevention, Atlanta, GA. https://www.cdc.gov/violenceprevention/pdf/ipv/IPV-factsheet_2021.pdf.
Breiding, M. J., Basile, K. C., Smith, S. G., Black, M. C., Mahendra, R. R. (2015). Intimate partner violence surveillance: uniform definitions and recommended data elements, version 2.0. National Center for Injury Prevention and Control, Centers for Disease Control and Prevention, Atlanta.
World Health Organization. (2013). Global and regional estimates of violence against women: Prevalence and health effects of intimate partner violence and non-partner sexual violence. World Health Organization
Reed, L. A., Tolman, R. M., Ward, L.M. (2017). Gender matters: Experiences and consequences of digital dating abuse victimization in adolescent dating relationships. Journal of Adolescence, 59, 79-89
Velopulos, C. G., Carmichael, H., Zakrison, T. L., & Crandall, M. (2019). Comparison of male and female victims of intimate partner homicide and bidirectionality-an analysis of the national violent death reporting system. Journal of Trauma and Acute Care Surgery, 87(2), 331–336.
Puzone, C. A., Saltzman, L. E., Kresnow, M.-J., Thompson, M. P., Mercy, J. A. (2000). National trends in intimate partner homicide: United states, 1976-1995. Violence Against Women, 6(4), 409–426.
Afifi, T. O., MacMillan, H., Cox, B. J., Gordon, J., Asmundson, G., Stein, M. B., Sareen, J. (2009). Mental health correlates of intimate partner violence in marital relationships in a nationally representative sample of males and females. Journal of Interpersonal Violence, 24(8), 1398-1417
Karakurt, G., Patel, V., Whiting, K., & Koyutürk, M. (2017). Mining electronic health records data: Domestic violence and adverse health effects. Journal of Family Violence, 32(1), 79–87.
Whiting, K., Liu, L. Y., Koyutürk, M., Karakurt, G. (2017). Network map of adverse health effects among victims of intimate partner violence. In: Biocomputing 2017, page 324-335. WORLD SCIENTIFIC
Lagdon, S., Armour, C., & Stringer, M. (2014). Adult experience of mental health outcomes as a result of intimate partner violence victimisation: A systematic review. European Journal of Psychotraumatology 5(1), 24794.
Hacıaliefendioğlu, A., Yılmaz, S., Koyutürk, M., Karakurt, G. (2020). Co-occurrence patterns of intimate partner violence. In BIOCOMPUTING 2021: Proceedings of the Pacific Symposium, pages 79–90. World Scientific .
Straus, M. A. (1979). Measuring intrafamily conflict and violence: The conflict tactics (ct) scales. Journal of Marriage and the Family, 41(1), 75.
Straus, M. A., Hamby, S. L., BONEY-McCOY, S., Sugarman, D. B. (1996) The revised conflict tactics scales (cts2): Development and preliminary psychometric data. Journal of Family Issues, 17(3), 283-316.
Shepard, M., F., Campbell, J., A. (1992). The abusive behavior inventory: A measure of psychological and physical abuse. Journal of interpersonal violence, 7(3), 291–305.
Tolman, R. M. (1989). The development of a measure of psychological maltreatment of women by their male partners. Violence and Victims, 4(3), 159–177.
Smith, P.H., Earp, J.A., DeVellis, R. (1995). Measuring battering: Development of the women’s experience with battering (web) scale. Women’s Health (Hillsdale, N.J.), 1(4), 273-288.
Koss, M., P.,Gidycz, C., A. (1985). Sexual experiences survey: Reliability and validity. Journal of consulting and clinical psychology, 53(3), 422.
Chu, K.-H., Colditz, J., Malik, M., Yates, T., & Primack, B. (2019). Identifying key target audiences for public health campaigns: Leveraging machine learning in the case of hookah tobacco smoking. Journal of Medical Internet Research, 21(7), e12443.
Sarker, A., Ginn, R., Nikfarjam, A., O’Connor, K., Smith, K., Jayaraman, S., Upadhaya, T., & Gonzalez, G. (2015). Utilizing social media data for pharmacovigilance: A review. Journal of biomedical informatics, 54, 202–212.
Velasco, E., Agheneza, T., Denecke, K., Kirchner, G., & Eckmanns, T. (2014). Social media and internet-based data in global systems for public health surveillance: A systematic review. The Milbank Quarterly, 92(1), 7–33.
Laura L. S., and Neill B. B. (2014). The role of facebook in crush the crave, a mobile-and social media-based smoking cessation intervention: qualitative framework analysis of posts. Journal of medical Internet research, 16(7), e3189.
Birnbaum, M., L., Ernala, S., iranmai, R., Asra, F., Choudhury, M. De., Kane, J. M. A. (2017). Collaborative approach to identifying social media markers of schizophrenia by employing machine learning and clinical appraisals. Journal of Medical Internet Research, 19(8), e7956
Lee, H.-S., Lee, H.-R., Park, J.-U., & Han, Y.-S. (2018). An abusive text detection system based on enhanced abusive and non-abusive word lists. Decision Support Systems, 113, 22–31.
Hammer, H. L. (2014). Detecting threats of violence in online discussions using bigrams of important words. In 2014 IEEE Joint Intelligence and Security Informatics Conference, pages 319–319.
Cohen, K., Johansson, F., Kaati, L., Mork, J. C. (2014). Detecting linguistic markers for radical violence in social media. Terrorism and Political Violence, 26(1), 246-256
Warner, W., Hirschberg, J. (2012). Detecting hate speech on the world wide web. In Proceedings of the Second Workshop on Language in Social Media, pages 19–26, Montréal, Canada. Association for Computational Linguistics.
Liu, S., Forss, T. (2015). New classification models for detecting hate and violence web content. In 2015 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K), volume 01, pages 487–495.
Garimella, V. R. K., Alfayad, A., Weber, I. (2016). Social media image analysis for public health. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, CHI ’16, page 5543-5547. Association for Computing Machinery
Cer, D., Yang, Y., Kong, S.-Y., Hua, N., Limtiaco, N., St. John, R., Constant, N., Guajardo-Cespedes, M., Yuan, S., Tar, C., Strope, B., Kurzweil, R. (2018). Universal sentence encoder for English. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pages 169–174, Brussels, Belgium, November . Association for Computational Linguistics.
Campbell, J. C. (1995). Assessing dangerousness: Violence by sexual offenders, batterers, and child abusers. Sage Publications, Inc .
Hegarty, K., Sheehan, M., & Schonfeld, C. (1999). A multidimensional definition of partner abuse: Development and preliminary validation of the composite abuse scale. Journal of family violence, 14(4), 399–415.
Hegarty, K., Bush, R., & Sheehan, M. (2005). The composite abuse scale: further development and assessment of reliability and validity of a multidimensional partner abuse measure in clinical settings. Violence and victims, 20(5), 529–547.
Rodenburg, F. A., & Fantuzzo, J. W. (1993). The measure of wife abuse: Steps toward the development of a comprehensive assessment technique. Journal of Family Violence, 8(3), 203–228.
Hudson, WW. (1992) Partner abuse scale: Physical (pasph). WALMYR Assessment Scales Scoring Manual. WALMYR Publishing .
Foshee, V. A., Fletcher L., G Foshee, B., Karl E., Langwick, S. A., Arriaga, X. B., Heath, J. L., McMahon, P. M., & Bangdiwala, S. (1996). The safe dates project: Theoretical basis, evaluation design, and selected baseline findings. American journal of preventive medicine, 12(5), 39–47.
Foshee, V. A., Bauman, K. E., Arriaga, X. B., Helms, R. W., Koch, G. G., & Linder, G. F. (1998). An evaluation of safe dates, an adolescent dating violence prevention program. American journal of public health, 88(1), 45–50.
Marshall, L. L. (1992). Development of the severity of violence against women scales. Journal of family violence, 7(2), 103–121.
Sullivan, CM., Parisian, JA., Davidson, WS. (1991). Index of psychological abuse: Development of a measure. In: Poster presentation at the annual conference of the American Psychological Association.
Sullivan, C. M., & Bybee, D. I. (1999). Reducing violence using community-based advocacy for women with abusive partners. Journal of consulting and clinical psychology, 67(1), 43.
O’Leary, K. D. (1999). Psychological abuse: A variable deserving critical attention in domestic violence. Violence and victims, 14(1), 3–23.
Murphy, C. M., Hoover, S. A., Taft, C. (1999). The multidimensional measure of emotional abuse: Factor structure and subscale validity. In: Annual meeting of the Association for the Advancement of Behavior Therapy.
Sackett, L. A., & Saunders, D. G. (1999). The impact of different forms of psychological abuse on battered women. Violence and victims, 14(1), 105–117.
Tolman, R. M. (1999). The validation of the psychological maltreatment of women inventory. Violence and victims, 14(1), 25–37.
Smith, P. H., Smith, J. B., & Earp, J. A. L. (1999). Beyond the measurement trap: A reconstructed conceptualization and measurement of woman battering. Psychology of Women Quarterly, 23(1), 177–193.
Smith, P. H., Thornton, G. E., DeVellis, R., Earp, J., & Coker, A. L. (2002). A population-based study of the prevalence and distinctiveness of battering, physical assault, and sexual assault in intimate relationships. Violence against women, 8(10), 1208–1232.
Kilpatrick, D., Edmunds, C., & Seymour, A. K. (1992). The national women’s study. National Victim Center.
Resnick, H. S., Kilpatrick, D. G., Dansky, B. S., Saunders, B. E., & Best, C. L. (1993). Prevalence of civilian trauma and posttraumatic stress disorder in a representative national sample of women. Journal of consulting and clinical psychology, 61(6), 984.
Tjaden, P. & Thoennes, N. (2000). Full report of the prevalence, incidence, and consequences of violence against women: Findings from the national violence against women survey. Annotation.
Koss, M. P., & Oros, C. J. (1982). Sexual experiences survey: A research instrument investigating sexual aggression and victimization. Journal of Consulting Psychology, 50(3), 455–457.
Koss, M. P., Gidycz, C. A., & Wisniewski, N. (1987). The scope of rape: Incidence and prevalence of sexual aggression and victimization in a national sample of higher education students. Journal of consulting and clinical psychology, 55(2), 162.
Belknap, J., Fisher, B. S., & Cullen, F. T. (1999). The development of a comprehensive measure of the sexual victimization of college women. Violence Against Women, 5(2), 185–214.
Fisher, B. S., Cullen, F. T., Turner, M. G. (2000). The sexual victimization of college women. Research Report.
Busby, D. M., Christensen, C., Crane, D R., & Larson, J. H. (1995). A revision of the dyadic adjustment scale for use with distressed and nondistressed couples: Construct hierarchy and multidimensional scales. Journal of Marital and family Therapy, 21(3), 289–308.
Endler, N. S., & Parker, J. (1990). Co** inventory for stressful situations. Multi-Health systems Incorporated .
Brennan, K. A., Clark, C. L., & Shaver, P. R. (1998). Self-report measurement of adult attachment: An integrative overview. In J. A. Simpson & W. S. Rholes (Eds.), Attachment theory and close relationships (pp. 46–76). The Guilford Press.
Hamby, S. L. (1996). The dominance scale: Preliminary psychometric properties. Violence and Victims, 11(3), 199–212.
Ware, J. E. Jr., Sherbourne, C. D. (1992) The MOS 36-item short-form health survey (SF-36). I. Conceptual framework and item selection. Medical Care, 30(6), 473–483.
Bodenmann, G. (2008). Dyadic co** and the significance of this concept for prevention and therapy. Zeitschrift für Gesundheitspsychologie, 16(3), 108–111.
Erickson, R. J. (1993). Reconceptualizing family work: The effect of emotion work on perceptions of marital quality. Journal of Marriage and the Family, 55(4), 888–900.
Elliott, D. M., & Briere, J. (1992). Sexual abuse trauma among professional women: Validating the trauma symptom checklist-40 (tsc-40). Child abuse & Neglect, 16(3), 391–398.
Watson, D., Clark, L. A., & Tellegen, A. (1988). Development and validation of brief measures of positive and negative affect: The panas scales. Journal of personality and social psychology, 54(6), 1063.
Davis, M. H. (1983). Measuring individual differences in empathy: Evidence for a multidimensional approach. Journal of personality and social psychology, 44(1), 113.
Lambert, M. J., Burlingame, G. M., Umphress, V., Hansen, N. B., Vermeersch, D. A., Clouse, G. C., & Yanchar, S. C. (1996). The reliability and validity of the outcome questionnaire. Clinical Psychology & Psychotherapy: An International Journal of Theory and Practice, 3(4), 249–258.
Andrews, P., & Meyer, R. G. (2003). Marlowe–crowne social desirability scale and short form C: Forensic norms. Journal of clinical psychology, 59(4), 483–492.
John, O. P., Srivastava, S. (1999). The Big-Five trait taxonomy: History, measurement, and theoretical perspectives, vol 2. University of California Berkeley .
Marshall, G. N., & Hays, R. D. (1994). The patient satisfaction questionnaire short-form (PSQ-18), vol 7865. Rand Santa Monica.
Avolio, B. J., Bass, B. M., & Jung, D. I. (1999). Re-examining the components of transformational and transactional leadership using the multifactor leadership. Journal of occupational and organizational psychology, 72(4), 441–462.
Howlett, A. (2015). Predictors of academic achievement, motivation and student disengagement in university students. PhD thesis, University of Tasmania.
Lam, S-F., Jimerson, S., Wong, B. P. H., Kikas, E., Shin, H., Veiga, F. H., Hatzichristou, C., Polychroni, F., Cefai, C., & Negovan, V. (2014). Understanding and measuring student engagement in school: The results of an international study from 12 countries. School Psychology Quarterly, 29(2), 213.
Jorgensen, B. L.(2007) Financial literacy of college students: Parental and peer influences. PhD thesis, Virginia Tech .
Yiyun P., & Linda N. B. (2015). Driver’s adaptive glance behavior to in-vehicle information systems. Accident Analysis & Prevention, 85, 93–101.
Ersche, K. D., Lim, T.-V., Ward, L. H. E., Robbins, T. W., Jan, & S. (2017). Creature of habit: A self-report measure of habitual routines and automatic tendencies in everyday life. Personality and Individual Differences, 116 73–85.
British Health Foundation. Lifestyle Questionaire.
Taylor, H. L., Jacobs Jr, D. R., Schucker, B., Knudsen, J., Leon, A. S., & Debacker, G. (1978). A questionnaire for the assessment of leisure time physical activities. Journal of chronic diseases, 31(12), 741–755.
Pianta, R. C., & Steinberg, M. (1992). Teacher–child relationships and the process of adjusting to school. In R. C. Pianta (Ed.), Beyond the parent: The role of other adults in children's lives (pp. 61–80). Jossey-Bass.
Troyer, A. K., & Rich, J. B. (2002). Psychometric properties of a new metamemory questionnaire for older adults. The Journals of Gerontology Series B: Psychological Sciences and Social Sciences, 57(1), P19–P27.
Northouse, P. G. (2014). Leadership: Theory and practice. Sage publications.
Thompson, M. P., Basile, K. C., Hertz, M. F., Sitterle, D. (2006). Measuring intimate partner violence and victimization and perpetration: A compendium of assessment tools. Centers for Disease Control and Prevention, Atlanta, GA. http://www.cdc.gov/ncipc/pub-res/IPV_Compendium.pdf.
Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., Dean, J. (2013). Distributed representations of words and phrases and their compositionality. In Advances in Neural Information Processing Systems, vol 26. Curran Associates, Inc.
Wang, X., Peng, Y., Lu, L., Lu, Z., Bagheri, M., Summers, R. M. (2019). ChestX-ray: Hospital-Scale Chest X-ray Database and Benchmarks on Weakly Supervised Classification and Localization of Common Thorax Diseases, p 369-392. Advances in Computer Vision and Pattern Recognition. Springer International Publishing.
Chen, Q., Yifan P., Zhiyong L. (2020). Biosentvec: Creating sentence embeddings for biomedical texts. ar**v .
Majumder, S. B., & Das, D. (2020). Detecting fake news spreaders on twitter using universal sentence encoder. In: CLEF (Working Notes).
Asgari-Chenaghlu, M., Nikzad-Khasmakhi, N., Minaee, S. (2020). Covid-transformer: Detecting trending topics on twitter using universal sentence encoder. ar**v e-prints, pages ar**v–2009.
Iyyer, M., Manjunatha, V., Boyd-Graber, J., Hal D. III. (2015). Deep unordered composition rivals syntactic methods for text classification. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), p 1681–1691, Bei**g, China. Association for Computational Linguistics.
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., Polosukhin, I. (2017). Attention is all you need. In Advances in neural information processing systems, p 5998–6008.
Ríssola, E. A., Losada, D. E. Crestani, F. (2021). A survey of computational methods for online mental state assessment on social media. ACM Transactions on Computing for Healthcare, 2, 1–31. https://doi.org/10.1145/3437259.
Davis, A. (2014). Violence-related mild traumatic brain injury in women: Identifying a triad of postinjury disorders. Journal of Trauma Nursing|, 21(6), 300–308.
Christ, C., De Waal, M. M., Dekker, J. J. M., Kuijk, I. van, Van Schaik, D. J. F., Kikkert, M. J., & Messman-Moore, T. L. (2019). Linking childhood emotional abuse and depressive symptoms: The role of emotion dysregulation and interpersonal problems. PLoS One, 14(2). e0211882.
Follingstad, D. R. (2009). The impact of psychological aggression on women’s mental health and behavior: The status of the field. Trauma, Violence, & Abuse, 10(3), 271–289.
Engel, B. (2002). The emotionally abusive relationship: How to stop being abused and how to stop abusing. John Wiley & Sons.
Hunter, R. F., Gough, A., O’Kane, N., McKeown, G., Fitzpatrick, A., Walker, T., McKinley, M., Lee, M., & Kee, F. (2018). Ethical issues in social media research for public health. American Journal of Public Health, 108(3):343–348.
Townsend, L., & Wallace, C. (2016). Social media research: A guide to ethics. University of Aberdeen, 1, 16.
Al-Rubaie, M, & Chang, J M. (2019). Privacy-preserving machine learning: Threats and solutions. IEEE Security & Privacy, 17(2), 49–58.
Acknowledgements
This publication was made possible by US National Health Institutes (NIH) grant R01-LM012518 from the National Library of Medicine. Its contents are solely the responsibility of the authors and do not necessarily represent the official views of the NIH.
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
On behalf of all authors, the corresponding author states that there is no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Trinh Ha, P., D’Silva, R., Chen, E. et al. Identification of intimate partner violence from free text descriptions in social media. J Comput Soc Sc 5, 1207–1233 (2022). https://doi.org/10.1007/s42001-022-00166-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s42001-022-00166-8