All Articles

Detecting depression using digital traces on social media

social media smart phone illustrationAs seen in the fall 2025 Ivy magazine.

Ivy business researcher joins team to develop a deep learning model to detect mental health symptoms and risk factors on social media.

Social media companies could use the model to create an early warning system.

Social media is a sounding board of sorts in today’s world. It’s a place where people go to share their thoughts, opinions, and feelings – and to test the reactions of others, too. In return, they receive feedback, often in real-time, as well as support and validation.

For some individuals, sharing on-screen may also provide an easier entry point to communication, even if they aren’t fully cognizant of it, versus sharing in person or face-to-face.

“On social media, individuals often perceive a distinction between their online persona and real-world identity. Some people feel more at ease disclosing feelings like hopelessness or experiences such as divorce or job loss, due to the perceived anonymity or distance afforded by social media,” said Wenli Zhang, assistant professor of information systems and business analytics. “Those digital traces are what we want to extract.”

Zhang is part of a research team that has developed a deep learning model to detect mental health symptoms and risk factors on social media. Their study, “Depression Detection Using Digital Traces on Social Media: A Knowledge-aware Deep Learning Approach,” was published by the Journal of Management Information Systems.

Depression is one of the most common mental illnesses in the United States. According to the National Institute of Mental Health (NIMH), an estimated 21 million American adults (8.3 percent of all adults in the United States) experienced at least one major depressive episode in 2021. Of these adults, an estimated 61 percent received treatment, the NIMH reports.

“Depression and other mental health disorders are commonly underdiagnosed and consequently undertreated,” Zhang said. “Stigma may prevent a patient from answering honestly during a health screening in a doctor’s office, so with this research, our goal is to identify and offer another, complementary approach.”

Potential applications

Zhang emphasizes that the “deep knowledge-aware depression detection framework” has potential applications for individuals, as well as public health professionals, policymakers, and researchers.

Social media companies could use the model to create an early warning system, suggesting when individuals should seek help and providing them with resources, while public health professionals and policymakers could look at population-level data to determine which locations or demographics need more mental health services.

The model provides a unique opportunity to collect population-level data over time.

“For example, we could look at the past 10 years of X (formerly Twitter) data and associate it with different events such as wars, pandemics, etc. There’s no way we could get that level of data from surveys,” Zhang said.

“Other researchers have developed models to detect depression on social media. However, the ‘deep knowledge-aware depression detection framework’ differs from existing work because it compares medical terminology for depression risks and symptoms with an individual’s social media posts over time. Previous studies have looked at posts with positive or negative sentiments, which we don’t think is exact for depression detection, because someone could complain about a bad movie, bad weather, etc. However, they are not indicators of someone with depression,” Zhang said.

Zhang and her fellow team of researchers taught their model how to detect depression systems and risks using more than 1.3 million archival Reddit posts and 2,500 WebMD entries. The model can also use other social media platforms and datasets, and another study under review indicates a new version of the model can detect additional mental health disorders.

Ethical and privacy concerns

Zhang and her co-authors say using social media to detect symptoms and risk factors of chronic diseases could be a cost-efficient intervention since public posts provide a large, diverse, and free dataset. However, they recognize that there are ethical and privacy-related concerns that need to be addressed.

“Addressing the potential for abuse and ensuring the responsible use of social media-based depression detection machine learning models involves a combination of ethical considerations, legal safeguards, and technical measures,” Zhang said.

Social media platforms should prioritize informed consent when collecting data for health-related machine learning models, even if the data is anonymized. Zhang said social media companies should also ensure that data collection, storage, and usage practices comply with privacy laws and regulations.

Going forward, Zhang and her team want to expand their model to include other aspects of health, including diabetes, heart disease, and asthma. They envision incorporating photos, video, and audio from social media to capture more behavioral data.

Zhang said machine learning is not a replacement for traditional healthcare. Rather, it’s another approach to assist individuals and provide population-level data to help providers and policymakers.

December 4, 2025