Skip Navigation
Skip to contents

STRESS : STRESS

Sumissioin : submit your manuscript
SEARCH
Search

Articles

Page Path
HOME > STRESS > Volume 34(1); 2026 > Article
Original Article
한국의 국가 정신건강 조사 체계 간 불일치: 구조화된 면접과 온라인 자가보고에 따른 유병률 추정치의 현격한 차이
이상하1orcid, 양수진2orcid
Incompatible National Mental Health Surveillance Systems in South Korea: Dramatically Different Prevalence Estimates between Structured Interviews and Brief Online Self-Reports
Sangha Lee1orcid, Su-Jin Yang2orcid
STRESS 2026;34(1):14-24.
DOI: https://doi.org/10.17547/kjsr.2026.34.1.14
Published online: March 30, 2026

1아주대학교 의과대학/아주대학교병원 정신건강의학교실 연구전담조교수

2국립정신건강센터 정신건강사업과 과장/전문의

1Research Professor, Department of Psychiatry, Ajou University School of Medicine/Ajou University Hospital, Suwon, Korea

2Director/Psychiatrist, Division of Mental Health Service, National Center for Mental Health, Seoul, Korea

Corresponding author Su-Jin Yang Division of Mental Health Service, National Center for Mental Health, 127, Yongmasan-ro, Gwangjin-gu, Seoul 04933, Korea Tel: +82-2-2204-0106 Fax: +82-2-2204-0383 E-mail: sj7512@gmail.com
• Received: December 5, 2025   • Revised: January 29, 2026   • Accepted: January 29, 2026

Copyright © 2026 Korean Society of Stress Medicine.

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

prev next
  • 245 Views
  • 14 Download
  • 한국은 면접 기반의 진단 조사와 웹 기반 자가보고 조사라는 상이한 방법론을 적용한 두 가지 주요 국가 정신건강 조사 체계를 운영하고 있다. 정확한 정책 수립을 위해서는 이 두 체계가 산출하는 추정치의 차이를 이해하는 것이 필수적이다. 본 연구는 두 국가 조사 간의 정신건강 보고 양상과 대중의 인식을 비교하였다. 2021년 정신건강실태조사(오프라인, n=5,511)와 2021년 대국민 정신건강 지식 및 태도 조사(온라인, n=2,016)의 데이터를 분석하였다. 인구통계학적 변수를 통제한 상태에서 정신장애의 평생 유병률과 인식을 비교하기 위해 로지스틱 회귀분석 및 선형 회귀분석을 실시하였다. 오프라인 응답자에 비해 온라인 응답자는 우울장애(오프라인 대 온라인 OR=0.172), 불안장애(OR=0.256), 강박장애(OR=0.019)를 보고할 가능성이 유의하게 높았다. 반면, 알코올 사용 장애(OR=1.331) 보고 가능성은 오프라인 응답자가 더 높았다. 인식 측면에서는 오프라인 응답자가 온라인 응답자에 비해 정신장애인을 고용하거나 친구로 지내는 것에 대해 더 부정적인 태도를 보였다. 두 국가 조사 체계는 상당히 다른 유병률 추정치와 인식 양상을 나타냈다. 이러한 차이는 설문 방식(익명성 대 면접자를 통한 대면), 평가 도구(자가보고 대 구조화된 진단 면접), 그리고 이들 간의 상호작용이 복합적으로 반영된 결과로 보인다. 정책 입안자는 이러한 방법론적 차이를 고려하여 데이터를 해석해야 하며, 향후 설문 방식에 따른 효과를 명확히 규명하기 위해 서로 다른 방식에서 동일한 도구를 적용해보는 후속 연구가 필요하다.
  • Background
    South Korea operates two major national mental health surveillance systems that employ different methodologies: an interview-based diagnostic survey and a web-based self-report survey. Understanding how these systems yield different estimates is essential for accurate policy planning. This study compared mental health reporting patterns and public perceptions between these two national surveys.
  • Methods
    We analyzed data from the National Mental Health Survey 2021 (offline, n=5,511) and National Mental Health Literacy and Attitude Survey 2021 (online, n=2,016). Logistic and linear regression analyses were performed to compare the lifetime prevalence rates of mental disorders and their perceptions, controlling for demographic variables.
  • Results
    Compared with offline respondents, online respondents were significantly more likely to report depression, anxiety, and obsessive-compulsive disorder. Conversely, offline respondents were more likely to report alcohol use disorder. Offline respondents exhibited more negative attitudes toward hiring or befriending individuals with mental disorders than did online respondents.
  • Conclusions
    The two national surveillance systems yielded substantially different prevalence estimates and perception patterns. These differences likely reflect a combination of survey modes (anonymity vs interviewer presence), assessment instruments (structured diagnostic interview vs self-report), and their interactions. Policymakers should interpret data from these systems with an awareness of their methodological differences. Future research using identical instruments across modes is required to isolate mode-specific differences.
Mental health issues are emerging as major public health concerns in modern society, and accurate data collection is essential for devising effective policy interventions. However, because mental health is a sensitive topic, respondents may not accurately report their actual conditions owing to social desirability bias [1-3]. This bias can undermine the reliability of survey results and impede the development of effective policies. Therefore, surveys on the status and perception of mental health require methodological approaches that ensure respondent’ anonymity and encourage honest responses.
Widespread accessibility of the Internet has revolutionized global survey methodologies. In 2024, the global Internet penetration rate was 68%, and in South Korea, it exceeded 98% [4]. Web-based surveys are efficient in terms of time and cost and provide participants with anonymity, which increases the likelihood of obtaining honest responses to sensitive questions. Consequently, Internet surveys have become a vital tool in fields such as mental health, social attitudes, and political awareness.
Despite these advantages, online surveys pose challenges, particularly in terms of data quality and respondent representativeness. Selection bias, response validity, and digital accessibility disparities are significant concerns that must be addressed when interpreting online survey data. Compared with traditional paper-and-pen surveys, online surveys offer the advantages of faster and cheaper respondent recruitment and data collection. These features render them particularly useful in studies targeting large populations. For example, Internet-based surveys enable data collection across diverse cultural contexts and geographical regions unrestricted by physical boundaries, making them cost-effective.
However, selection bias is a significant issue in such surveys [5]. In online surveys, respondent pools may be limited by factors such as Internet access, technological accessibility, and digital literacy. This limitation often results in lower response rates among digitally vulnerable groups, such as older adults or low-income populations. Consequently, the representativeness of the data may be compromised, raising concerns about the reliability and generalizability of the survey’s findings [6].
Additionally, online surveys may face challenges regarding the truthfulness of responses to sensitive topics, which may vary depending on the level of anonymity provided. This issue is particularly prominent in surveys addressing sensitive subjects, such as mental health [7]. Sensitive survey items are often associated with high nonresponse rates and increased response measurement errors [8], particularly when the survey topic involves social desirability or political correctness [9]. Rather than revealing their true opinions or experiences, respondents may respond in ways they perceive as socially acceptable. This tendency is particularly evident in topics such as income levels, sexual behaviors, and religious beliefs, ultimately reducing data accuracy and validity [10].
Some studies have suggested that online surveys elicit more honest responses to sensitive questions than offline surveys do [11]. The anonymity offered by online surveys likely plays a critical role, as respondents can answer questions without concerns of being observed or judged [12]. However, these findings are not universally applicable, and the results may vary depending on the subject matter and survey question type.
Mental health surveys are gaining importance in South Korea. Social stigma and desirability biases often make obtaining accurate data for mental health research challenging. Thus, analyzing how online and offline survey methods influence mental health survey outcomes is an important research topic. Additionally, examining the effects of demographic factors such as sex, age, and occupation on mental health perceptions in South Korea is crucial for both academic advancement and practical policy recommendations.
South Korea currently operates two distinct national mental health surveillance systems: the National Mental Health Survey, which employs face-to-face interviews using the structured Korean Composite International Diagnostic Interview (K-CIDI), and the National Mental Health Literacy and Attitude Survey, which uses web-based self-report questionnaires. These systems differ not only in survey mode (interviewer-administered vs self-administered online) but also in their assessment instruments and diagnostic thresholds. Understanding how these operationally distinct systems yield different prevalence estimates and perception data is crucial for policymakers who must synthesize information from multiple sources.
This study aimed to systematically compare the mental health reporting patterns and public perceptions between these two national surveys. Rather than claiming to isolate a pure “survey mode effect,” which would require identical instruments administered across different modes, we examined how the overall survey context (encompassing mode, instrument, and setting) is associated with different reporting patterns. Additionally, we explored whether demographic factors, such as sex, age, and occupation, were associated with mental health perceptions across both surveys. Through this comparison, we sought to provide practical guidance for interpreting national mental health statistics and inform future methodological improvements in mental health surveillance.
1. Participants

1) National Mental Health Survey 2021, for offline data

The National Mental Health Survey is a mental health study of adults residing in South Korea (aged 18~79 years), conducted every five years, based on Article 10 of the Mental Health Welfare Act. Its purpose is to provide foundational data for the development of mental health policies by identifying the prevalence and risk factors for mental disorders, their sociodemographic distribution, related factors, and mental health awareness and attitudes [13]. Because this study focused on the relationship between individuals’ sociodemographic characteristics and their perceptions of and attitudes toward mental health, it analyzed data from 5,511 participants in the National Mental Health Survey from June to August 2021. This survey used face-to-face interviews conducted by trained interviewers using the K-CIDI to diagnose and analyze the presence or absence of psychiatric disorders. Additionally, supplementary instruments were used to identify perceptions, attitudes, and various related factors using additional survey items [14]. In-person interviews were conducted by visiting the selected households using tablet PCs equipped with diagnostic tools. Self-administered questionnaires were used for some tools.

2) National Mental Health Literacy and Attitude Survey 2021, for online data

We also used data from the National Mental Health Literacy and Attitudes 2021 survey conducted by the Ministry of Health and Welfare. The survey sample consisted of 2,016 citizens aged 15~69 years residing across South Korea. Participants were selected using proportional allocation based on population statistics by sex, age, region, and administrative district. The survey was conducted online from June 18 to 30, 2021. The survey gathered information on the participants’ sociodemographic characteristics, mental health literacy, and current status regarding mental health issues [15]. It employed a case vignette-based mental health perception questionnaire developed in 2021 [16].
Table 1 presents the demographic characteristics of the study participants, including age, sex, occupation, educational level, and income. Age was categorized into six groups: teenagers and adults in their 20s, 30s, 40s, 50s, and 60s or older. Occupation was classified into 13 categories based on the Korean Standard Occupational Classification system. Formal education was divided into five levels: elementary school, middle school, high school, university graduate, and graduate school or higher. Monthly income was classified into eight categories.
2. Survey questions
The two surveys used slightly different formats to assess mental health disorders. In the offline survey, structured diagnostic tools were employed, and participants were asked specific questions, including, “During your life, have you ever felt sad, empty, or depressed almost all day, every day, for more than two weeks?” (depressive disorder); “Have you ever experienced excessive or persistent worry?” (anxiety disorder); “Have you ever seriously thought about committing suicide?” (suicidal thoughts); “Have you ever been bothered by unwanted, persistent thoughts? For instance, have you had obsessive concerns about cleanliness, germs, or other intrusive thoughts?” (obsessive-compulsive disorder [OCD]); and “During your life, have you frequently been intoxicated or impaired to the extent that it interfered with school, work, or home life?” (alcohol use disorder [AUD]). Conversely, the online survey relied on simple yes/no questions designed to address the same mental health conditions. Participants were asked whether they had experienced depressive feelings lasting several days, anxiety lasting several days, suicidal thoughts, alcohol-related problems, uncontrolled obsessive thoughts, or compulsive behaviors, such as repetitive hand washing or checking.
Although the original K-CIDI items allow the derivation of both lifetime and 12-month diagnoses, and the online items reference experiences within the past 12 months, differences in recency cannot be perfectly harmonized. Therefore, we analyzed all responses as binary “ever” experience indicators, acknowledging that residual timeframe differences may remain. We acknowledge that the two surveys employed different assessment instruments. The offline survey used the K-CIDI, a structured diagnostic interview, whereas the online survey used symptom-based self-reporting questions. This methodological difference is a limitation of the present study.
3. Statistical analysis
Logistic regression analyses were conducted to investigate the differences in the lifetime prevalence rates of mental disorders between online and offline respondents. The primary aim of these analyses was to determine whether significant differences existed between the two groups while controlling for age, sex, income, and education levels as covariates. The dependent variables in the analyses were lifetime prevalence rates of mental disorders, including depression, anxiety, OCD, AUD, and suicidal thoughts. The key independent variable was the respondent group, categorized as either offline (Group 1) or online (Group 2). The regression models were controlled for covariates, including age (as a continuous variable), sex (male or female), income level, and education level (both as categorical variables).
Descriptive statistics were first calculated to summarize the demographic characteristics of the two groups and explore their distributions. Logistic regression analyses were performed in two stages. In the univariate analyses, the independent effects of each covariate and group variable on lifetime prevalence rates were assessed. In the multivariable analyses, all covariates were included in a single model to evaluate the independent contributions of the group variables while accounting for potential confounding effects. The significance of each predictor was assessed using Wald Z-tests, with p <.05 considered statistically significant. The regression results for each disorder were reported using estimated coefficients, standard errors (SEs), and p-values to quantify the differences between groups. The model fit was evaluated using the -2 log likelihood and Akaike Information Criterion to ensure robustness.
The analyses were conducted using Python’s “statsmodels” package for logistic regression, with data preparation performed using Pandas. Matplotlib and Seaborn were used to visualize the results. Given multiple comparisons across the five mental health outcomes and four perception variables, we applied the Benjamini-Hochberg procedure to control the false discovery rate (FDR) at 0.05. Both unadjusted and FDR-adjusted p-values were reported. Because the publicly available microdata for both surveys did not include the design weights required for population-level estimation, unweighted data were used for all analyses. Consequently, the results reflect the characteristics of the sample rather than those of the entire population, and model-based SEs were reported.
4. Ethical considerations and approvals
This study was approved by the Institutional Review Board of the National Center for Mental Health (IRB No. 116271-2024-04).
The logistic regression analysis results comparing the lifetime prevalence rates of depression, anxiety, OCD, AUD, and suicidal thoughts between the two groups are presented in Table 2. These analyses were conducted to evaluate whether significant differences existed between online and offline respondents while controlling for age, sex, income, education level, and occupation. For each disorder, regression coefficients, SEs, odds ratios (ORs), and p-values were used to quantify group differences. The dependent variables were the presence of depressive or anxiety disorders, OCD, AUD, or suicidal thoughts (no=1, yes=2). The predictor variable was the group (online vs offline), with age, sex, occupation, education, and income included as control variables.
For depressive disorder, the coefficient of the group variable (offline vs online) was -1.761 (SE=0.085, p<.001). The corresponding OR of 0.172 indicated that offline respondents had 82.8% lower odds of reporting depression than online respondents (95% CI, 0.145~0.200). Equivalently, online respondents had approximately 5.81 times higher odds (1/0.172). For anxiety disorders, the coefficient of the group variable was -1.363 (SE=0.089, p<.001). An OR of 0.256 indicated that offline respondents had 74.4% lower odds of reporting anxiety than online respondents (equivalently, online respondents had approximately 3.91 times higher odds).
For AUD, the analysis showed that the coefficient of the group variable was 0.286 (SE=0.094, p=.002), indicating that offline respondents had significantly higher odds of reporting AUD than online respondents. The OR for the group variable was 1.331, indicating that offline respondents were 33.1% more likely to report AUD than their online counterparts. The intercept was −2.249 (SE=0.580, p<.001), representing the baseline log odds of AUD when all predictors were at their reference levels. This pattern (higher AUD reporting in offline surveys) was the opposite of that observed for internalizing disorders.
For OCD, the coefficient of the group variable was −3.95 (SE=0.262, p<.001), indicating that online respondents were significantly more likely to report OCD than offline respondents. An OR of 0.019 indicated that offline respondents had 98.1% lower odds of reporting OCD than online respondents (equivalently, online respondents had approximately 52.6 times higher odds).
Finally, for suicidal thoughts, the coefficient of the group variable (offline vs online) was 0.13 (SE=0.093, p=.163). Although the OR was 1.139, suggesting slightly higher odds for offline respondents, this difference was not statistically significant. The difference between groups was not statistically significant. After applying the Benjamini-Hochberg correction for multiple comparisons, all primary findings for depression, anxiety, OCD, and AUD remained statistically significant (adjusted p<.05). The nonsignificant result for suicidal thoughts remained unchanged even after correction.
A series of linear regression analyses were conducted to examine the differences in the perceptions of individuals with mental disorders between online (Group 1) and offline respondents (Group 2). Each analysis focused on a specific aspect of public attitudes toward mental disorders, with the group (online vs offline) as the primary independent variable and age, sex, occupation, education level, and income as control variables.
The first analysis explored attitudes toward forming friendships with individuals with mental disorders. The dependent variable measured the level of agreement with the statement, “I cannot be friends with someone who has a mental disorder,” with higher values indicating more negative perceptions. The regression coefficient of the group variable (offline vs online) was 0.031 (p=.025), indicating that offline respondents demonstrated significantly more negative perceptions than online respondents. Age also showed a significant effect, suggesting that older individuals were more likely to hold negative views than younger individuals.
The second analysis addressed perceptions of hiring individuals with mental disorders. The dependent variable measured agreement with the statement, “I cannot hire someone with a mental disorder as an employee,” with higher values indicating more negative perceptions. The regression coefficient of the group variable was 0.025 (p=.032), indicating that offline respondents exhibited significantly more negative attitudes toward hiring than online respondents. Age had a significant positive effect, suggesting that compared with younger individuals, older individuals were more likely to express negative attitudes toward hiring individuals with mental disorders.
The third analysis focused on the perception of danger associated with individuals with mental disorders. Agreement with the statement “People with mental disorders are dangerous” served as the dependent variable, with higher values indicating more negative perceptions. The regression results indicated that the coefficient of the group variable was 0.022 (p=.041), suggesting that offline respondents had significantly more negative perceptions (perceived greater danger) than online respondents. Age had a significant positive effect, indicating that compared with younger individuals, older individuals were more likely to perceive individuals with mental disorders as dangerous. Other control variables, such as sex and occupation, did not consistently exhibit statistically significant effects.
The final analysis examined beliefs regarding the persistence of mental disorders over time. The dependent variable measured agreement with the statement “mental disorders persist for a lifetime,” with higher values indicating stronger agreement (stigma). The regression results showed that the coefficient of the group variable was 0.017 (p=.017), indicating that offline respondents were more likely to agree with the statement than online respondents. Age was a significant factor, suggesting that compared with younger individuals, older individuals were more likely to believe that mental disorders persist throughout their lifetimes. The linear regression findings on public perceptions of mental disorders, examining attitudes toward employment, friendships, perceived danger, and belief in lifelong persistence, are detailed in Table 3.
As a robustness check, we restricted the sample to participants aged 18~69 years (the overlapping range) and re-estimated all logistic and linear regression models. The direction, magnitude, and statistical significance of the group effects remained consistent with the main findings, indicating that the findings were not driven by differences in the age range coverage between the two surveys.
This study compared the mental health reporting patterns of two national surveillance systems in South Korea. The findings revealed substantial differences between these systems, which may reflect a combination of the influence of survey mode effects (anonymity vs interviewer presence), differences in assessment instruments (structured diagnostic criteria vs simplified symptom questions), and their interactions. Although we may not definitively disentangle these contributing factors from the current design, the observed patterns offer important insights into both methodological refinement and policy interpretation.
Before discussing potential explanations, we emphasize that the following interpretations are necessarily speculative. Because this study compares two independently designed survey systems that differ in both survey mode and instruments, there are inherent limitations in clearly disentangling the pure effects of the survey mode from those of the survey instruments. Accordingly, the possibilities and explanatory mechanisms discussed below should not be interpreted as definitive causes of the observed results but rather as hypothetical explanations that may have contributed to the observed patterns.
The principal finding was the substantial divergence in prevalence estimates between the two surveillance systems. Respondents in the web-based survey reported markedly higher rates of internalizing disorders, specifically depression, anxiety, and OCD, than those in the interview-based survey. Several nonmutually exclusive explanations may account for this pattern. First, one possible contributing factor is reduced social pressure in anonymous web-based platforms, sometimes termed “online disinhibition,” which may facilitate a more candid disclosure of stigmatizing symptoms [17]. In face-to-face interviews, respondents may engage in impression management and underreport distress to avoid perceived judgment [12]. Second, the simplified self-reported items in the online survey may have lower diagnostic thresholds than the K-CIDI’s structured criteria, capturing a broader range of subclinical experiences. The online depression item, for instance, asked about “depressive feelings lasting several days,” whereas the K-CIDI required symptoms “almost all day, every day, for more than two weeks.” Instrument effects and mode-specific differences may operate differently across disorder types, potentially producing divergent, rather than uniform, patterns of bias. Thus, the observation of different directions of prevalence differences across conditions does not necessarily imply mode-related differences; disorder-specific measurement properties may also contribute to these patterns.
Conversely, AUD was reported at a higher rate in the interview-based survey than in the web-based survey. This reversal complicates a simple instrument-threshold explanation, arguing against a purely instrument-driven explanation. However, several of these interpretations require further investigation. First, drinking is deeply embedded in South Korean social and business culture, potentially rendering alcohol-related disclosures less stigmatizing than other psychiatric conditions in interpersonal contexts [18]. South Korean drinking culture is characterized by “hoesik,” workplace social gatherings where alcohol consumption serves as a crucial organizational socialization tool [19]. Research indicates that more than 80% of South Korean employees engage in binge drinking at least once within a 12-month period, and the “bottom-up” drinking culture, in which participants are expected to finish drinks in one shot, normalizes heavy alcohol consumption as a means of building workplace solidarity [20]. This cultural context, in which intoxication is often viewed as a sign of workplace camaraderie rather than a problem behavior, may facilitate a more open disclosure of drinking patterns during face-to-face interviews where social rapport is established. The rapport and social scaffolding of face-to-face interviews may facilitate discussions on drinking behaviors that are normatively accepted.
Second, the assessment items differed substantively between surveys. The K-CIDI alcohol module includes specific, behaviorally anchored questions about intoxication frequency (“Have you frequently been intoxicated to the extent that it interfered with work/home?”), which may prompt a concrete recall and disclosure during conversational interviews. In contrast, the online survey’s broader framing (“Have you had alcohol-related problems?”) requires respondents to self-categorize their drinking as “problematic,” a judgment that may be culturally influenced. In a society where heavy drinking and intoxication are normatively accepted and even encouraged as social bonding mechanisms [21], respondents completing the online survey may not perceive their drinking behaviors as constituting “problems,” even when those behaviors would meet diagnostic criteria in a clinical assessment. This interpretation aligns with research indicating that South Korean society maintains a tolerant attitude toward alcohol consumption, leading to many individuals remaining unaware of their drinking problems [22].
Third, disorder-specific factors may have contributed to this reverse pattern. Individuals with AUD often exhibit limited insight into the severity of their drinking problems, which may reduce self-reported prevalence in online surveys where no interviewer prompts self-reflection. Structured interviews with probing questions may facilitate a more accurate recognition of problematic drinking patterns. Conversely, the absence of interviewer guidance in online self-reports may lead to underreporting of alcohol-related symptoms that respondents do not spontaneously identify as problematic. However, we acknowledge that the AUD items also differed substantially between surveys (see Table 4). Thus, the reverse pattern, which suggests mode-specific social dynamics, does not constitute definitive evidence. Alternative explanations, such as unmeasured differences in respondents’ characteristics or differences in item interpretations, cannot be ruled out.
The two surveys yielded similar prevalence estimates for suicidal ideation, with no statistically significant between-survey differences. This null finding is interpretable considering the relatively high comparability of suicidal ideation items across surveys (see Table 4). Both items directly asked about lifetime suicidal thoughts with similar wording, unlike the more divergent items in the other conditions. The convergence of estimates for suicidal ideation suggests that when items are sufficiently similar, the two surveillance systems may yield comparable data, implying that the observed differences in other conditions may partly reflect item-level discrepancies. Alternatively, the stigma surrounding suicide may be so deeply entrenched that neither the anonymity of online surveys nor the rapport of face-to-face interviews substantially affects disclosure [9]. This interpretation aligns with South Korea’s persistently high suicide rates and strong cultural taboos regarding discussing suicidal thoughts. These competing explanations cannot be adjudicated with the current data. Future studies employing identical suicidal ideation measures across randomized survey modes would help clarify whether the observed convergence reflects true mode equivalence or coincidental measurements.
The linear regression analyses revealed that survey context was also associated with the expression of attitudes toward mental illness. Respondents in the interview-based survey expressed more stigmatizing views regarding hiring, befriending, and the perceived dangerousness of individuals with mental disorders than the web-based respondents. This pattern may reflect “public versus private” self-presentation: Individuals may feel compelled to express socially conservative or cautious attitudes when speaking to an interviewer (representing a public, accountable context) but hold or express more progressive views in the privacy of online self-completion. Alternatively, despite the statistical controls, residual differences between the two respondent populations, such as differential exposure to mental health advocacy or digital literacy correlates, may have contributed to these patterns. As with the prevalence findings, we could not isolate a pure-mode effect from other systematic differences between the surveys. However, the consistency of the patterns across multiple attitude measures suggests that the survey context meaningfully shapes expressed stigma, with implications for how public opinion data on mental health are collected and interpreted.
Our findings are broadly consistent with international research demonstrating a higher disclosure of internalizing symptoms in computer-administered and online surveys than in interviewer-administered modes [10]. A meta-analysis by Gnambs and Kaspar [10] found that sensitive behaviors were more frequently reported in self-administered modes, with effect sizes varying with topic sensitivity. However, the reversal observed for AUD in our study appears to be more context-specific and may reflect the unique cultural position of alcohol consumption in South Korean society. Cross-cultural comparative studies examining mode effects across societies with varying attitudes toward alcohol consumption would help determine whether this pattern is culturally specific or generalizable. Additionally, while Western studies have typically found that online anonymity uniformly reduces social desirability bias, our results suggest that the direction of mode effects may be disorder-specific, warranting further investigation into the psychological mechanisms underlying these differential patterns.
The substantial differences between the two national mental health surveillance systems in South Korea have important implications for data interpretation and policy planning. Policymakers who rely on these data sources should recognize that prevalence estimates are not directly comparable across systems. The interview-based K-CIDI survey likely yields more conservative, clinically aligned estimates, whereas the web-based self-report survey may capture a broader spectrum of self-perceived distress. Rather than viewing one system as superior, a pragmatic approach would leverage the complementary strengths of each. Web-based surveys, with fewer barriers to disclosure, may be valuable for detecting hidden distress in populations reluctant to seek formal evaluations. Interview-based surveys with diagnostic rigor provide data that are directly comparable to those of clinical populations and international epidemiological standards. Integrating the findings from both systems, while accounting for their methodological differences, may offer a more complete picture of the mental health of the South Korean population than either system alone.
However, caution should be exercised when translating these findings into specific policy recommendations. A higher self-reported prevalence in online surveys does not necessarily indicate more accurate detection, and without validation against clinical gold standards, we cannot determine whether web-based surveys reduce under-detection or introduce over-inclusion of nonclinical distress. Rigorous validation studies comparing survey-identified cases with structured clinical interviews are needed before recommending large-scale shifts toward online mental health screening.
1. Limitations
The most critical limitation of this study is the nonequivalence of the assessment instruments used in the two surveys. The offline National Mental Health Survey employed the K-CIDI, which is a structured diagnostic interview with specific duration, severity, and impairment criteria aligned with the Diagnostic and Statistical Manual of Mental Disorders (DSM)/International Classification of Diseases diagnostic standards. In contrast, the online survey used brief, symptom-based self-report questions with substantially lower specificity and different diagnostic thresholds (see Table 4). This fundamental methodological difference implies that the observed discrepancies in prevalence estimates cannot be unambiguously attributed to the survey mode (online vs offline anonymity).
The higher reporting of depression, anxiety, and OCD in the online survey may reflect (a) reduced social desirability bias due to the anonymity of web-based completion, (b) lower diagnostic thresholds inherent to the simplified self-report items, or (c) an interaction between both factors. Because these possibilities may not be clearly distinguished based on the current study design and findings, more sophisticated research designs, such as the simultaneous use of diagnostic tools in both online and offline modes for comparison, may be needed in the future.
The reversal observed for AUD provides suggestive evidence that mode-related social dynamics play a role beyond instrument effects; however, because the AUD items also differed between surveys, this interpretation remains tentative rather than conclusive. Additionally, the use of ORs to quantify between-group differences warrants cautious interpretation. When the prevalence exceeds 10%, ORs tend to overestimate the corresponding relative risk, and the large ORs observed for some conditions (e.g., OCD) may exaggerate the true magnitude of the difference between the survey systems.
The two surveys also employed different weighting procedures. The offline survey applied poststratification weights based on census distributions for age, sex, and region, whereas the online survey used quota sampling with panel-based adjustments. These differing weighting approaches may contribute to systematic differences in prevalence estimates independent of the actual reporting patterns. Second, the cross-sectional observational design precludes causal inferences. Respondents were not randomized to survey modes; those who participated in each national survey may have differed in unmeasured ways (e.g., help-seeking attitudes, comfort with technology, and personality traits), which confounded the comparison. Although we statistically controlled for available demographic variables, residual confounding factors likely remained. Third, the two surveys covered slightly different age ranges (offline: 18~79 years; online: 15~69 years) and showed distributional differences in education, occupation, and other characteristics. Although sensitivity analyses restricted to the overlapping age range yielded consistent results, complete population equivalence was not achievable. Fourth, we could not assess the convergent validity of either survey’s prevalence estimates against clinical gold standards (e.g., Structured Clinical Interview for DSM-5). Thus, it remains unclear whether either system yields prevalence estimates that more closely approximate population-level rates. Fifth, both datasets were collected in 2021 during the COVID-19 pandemic, which may have influenced both mental health status and survey response patterns in ways that limit generalizability to nonpandemic periods.
Considering these limitations, definitive conclusions regarding mode-related differences would require randomized experimental designs with identical instruments in different modes. We encourage future research employing randomized experimental designs with identical instruments across survey modes as well as methodological studies comparing weighting approaches to advance the understanding of mental health data collection.
In conclusion, this study documented substantial differences in mental health prevalence estimates and public perceptions between South Korea’s two national mental health surveillance systems. Web-based respondents reported higher rates of depression, anxiety, and OCD, while in-person interview-based respondents reported higher rates of AUD; suicidal ideation estimates were comparable across systems.
These differences likely reflect a combination of survey mode-related differences (anonymity vs interviewer presence), instrument effects (structured diagnostic criteria vs simplified self-reports), and their interaction, which may not be fully disentangled from observational data from independently designed surveys. The findings suggest that national mental health statistics are shaped not only by population health status but also by the methodological characteristics of the surveillance systems that generate them. For policy and practice, these results highlight the importance of methodological awareness when interpreting and comparing mental health data across sources. A multisystem approach that leverages the accessibility of web-based surveys alongside the diagnostic rigor of interview-based assessments may ultimately provide the most comprehensive picture of a population’s mental health. Future research with randomized, instrument-matched designs would help clarify these patterns.

Conflicts of interest

The authors declared no conflict of interest.

Funding

This study was supported by a clinical research grant (No. 2024-10) from the National Center for Mental Health, Republic of Korea.

Table 1.
Participants’ demographic data
Variable Category Online Offline Total
Sex Male (=1) 1,024 2,757 3,781
Female (=2) 992 2,754 3,746
Age group Under 20 104 144 248
20s 340 610 950
30s 356 899 1,255
40s 416 1,111 1,527
50s 444 1,231 1,675
60s or older 356 1,516 1,872
Education Elementary school 0 333 333
Middle school 54 483 537
High school 394 2,076 2,470
College 1,407 2,549 3,956
Graduate school 161 66 227
Occupation Manager 108 106 214
Expert and related workers 144 147 291
Office worker 660 859 1,519
Service worker 150 1,049 1,199
Sales worker 85 860 945
Agriculture, forestry, or fisheries worker 24 160 184
Technicians and related workers 59 347 406
Equipment and machine operators and assembly workers 45 144 189
Simple labor workers 81 275 356
Professional soldiers 3 0 3
Table 2.
Logistic regression results for lifetime prevalence of mental disorders by survey mode
Disorder Coefficient (offline vs online) SE OR 95% CI p-value
AUD 0.286 0.094 1.331 (1.106∼1.600) .002b)
Depression −1.761 0.085 0.172 (0.145∼0.200) <.001c)
Anxiety −1.363 0.089 0.256 (0.215∼0.295) <.001c)
OCD −3.95 0.262 0.019 (0.012∼0.032) <.001c)
Suicidal ideation 0.13 0.093 1.139 (0.949∼1.367) .163

a) p<.05,

b) p<.01,

c) p<.001.

OR represents odds for offline relative to online (reference: online). The models were controlled for age, sex, occupation, education, and income.

SE: standard error, OR: odds ratio, CI: confidence interval, OCD: obsessive-compulsive disorder, AUD: alcohol use disorder.

Table 3.
Linear regression results for public perceptions of mental disorders by survey mode
Perception variable Coefficient (offline vs online) SE Beta 95% CI p-value
Negative attitude toward hiring 0.025 0.012 0.035 (0.002∼0.048) .032a)
Negative attitude toward befriending 0.031 0.014 0.042 (0.004∼0.058) .025a)
Perceived danger 0.022 0.011 0.03 (0.001∼0.043) .041a)
Belief in lifelong persistence 0.017 0.008 0.024 (0.001∼0.033) .017a)

a) p<.05,

b) p<.01,

c) p<.001.

The coefficients represent the group effect (offline vs online) after controlling for age, sex, occupation status, educational level, and income. Higher values indicated more negative attitudes or stronger agreement with stigmatizing beliefs. The models were evaluated using FDR-adjusted p-values for multiple comparisons.

SE: standard error, CI: confidence interval, FDR: false discovery rate.

Table 4.
Comparison of assessment items and diagnostic criteria between offline and online surveys
Disorder Offline survey Online survey Item comparability
Depression "Have you ever felt sad, empty, or depressed almost all day, every day, for more than two weeks?" "Have you experienced depressive feelings lasting several days?" Low
Anxiety "Have you ever experienced excessive or persistent worry?" "Have you experienced anxiety lasting several days?" Moderate
OCD "Have you ever been bothered by unwanted, persistent thoughts?" "Have you had uncontrolled obsessive thoughts or compulsive behaviors?" Moderate
AUD "Have you frequently been intoxicated to the extent that it interfered with work/home?" "Have you had alcohol-related problems?" Low
Suicidal ideation "Have you ever seriously thought about committing suicide?" "Have you ever had suicidal thoughts?" High

Table 4 presents a direct comparison of the assessment items used in each survey. The two surveys employed fundamentally different approaches: The offline survey used structured diagnostic questions derived from the K-CIDI with specific duration and severity criteria, whereas the online survey used simplified symptom-based questions. This instrumental nonequivalence is a critical consideration when interpreting between-survey differences and represents an inherent limitation in comparing independently designed national surveillance systems. Importantly, however, the direction of the between-survey differences was not uniform across all conditions (e.g., the reversal observed for AUD), indicating that item differences alone cannot fully explain the observed patterns. Thus, the table clarifies the item-level differences and does not imply the construction of equivalence across systems.

K-CIDI: Korean Composite International Diagnostic Interview, OCD: obsessive-compulsive disorder, AUD: alcohol use disorder.

Figure & Data

References

    Citations

    Citations to this article as recorded by  

      • PubReader PubReader
      • ePub LinkePub Link
      • Cite
        CITE
        export Copy
        Close
        Download Citation
        Download a citation file in RIS format that can be imported by all major citation management software, including EndNote, ProCite, RefWorks, and Reference Manager.

        Format:
        • RIS — For EndNote, ProCite, RefWorks, and most other reference management software
        • BibTeX — For JabRef, BibDesk, and other BibTeX-specific software
        Include:
        • Citation for the content below
        Incompatible National Mental Health Surveillance Systems in South Korea: Dramatically Different Prevalence Estimates between Structured Interviews and Brief Online Self-Reports
        STRESS. 2026;34(1):14-24.   Published online March 30, 2026
        Close
      • XML DownloadXML Download
      Incompatible National Mental Health Surveillance Systems in South Korea: Dramatically Different Prevalence Estimates between Structured Interviews and Brief Online Self-Reports
      Incompatible National Mental Health Surveillance Systems in South Korea: Dramatically Different Prevalence Estimates between Structured Interviews and Brief Online Self-Reports
      Variable Category Online Offline Total
      Sex Male (=1) 1,024 2,757 3,781
      Female (=2) 992 2,754 3,746
      Age group Under 20 104 144 248
      20s 340 610 950
      30s 356 899 1,255
      40s 416 1,111 1,527
      50s 444 1,231 1,675
      60s or older 356 1,516 1,872
      Education Elementary school 0 333 333
      Middle school 54 483 537
      High school 394 2,076 2,470
      College 1,407 2,549 3,956
      Graduate school 161 66 227
      Occupation Manager 108 106 214
      Expert and related workers 144 147 291
      Office worker 660 859 1,519
      Service worker 150 1,049 1,199
      Sales worker 85 860 945
      Agriculture, forestry, or fisheries worker 24 160 184
      Technicians and related workers 59 347 406
      Equipment and machine operators and assembly workers 45 144 189
      Simple labor workers 81 275 356
      Professional soldiers 3 0 3
      Disorder Coefficient (offline vs online) SE OR 95% CI p-value
      AUD 0.286 0.094 1.331 (1.106∼1.600) .002b)
      Depression −1.761 0.085 0.172 (0.145∼0.200) <.001c)
      Anxiety −1.363 0.089 0.256 (0.215∼0.295) <.001c)
      OCD −3.95 0.262 0.019 (0.012∼0.032) <.001c)
      Suicidal ideation 0.13 0.093 1.139 (0.949∼1.367) .163
      Perception variable Coefficient (offline vs online) SE Beta 95% CI p-value
      Negative attitude toward hiring 0.025 0.012 0.035 (0.002∼0.048) .032a)
      Negative attitude toward befriending 0.031 0.014 0.042 (0.004∼0.058) .025a)
      Perceived danger 0.022 0.011 0.03 (0.001∼0.043) .041a)
      Belief in lifelong persistence 0.017 0.008 0.024 (0.001∼0.033) .017a)
      Disorder Offline survey Online survey Item comparability
      Depression "Have you ever felt sad, empty, or depressed almost all day, every day, for more than two weeks?" "Have you experienced depressive feelings lasting several days?" Low
      Anxiety "Have you ever experienced excessive or persistent worry?" "Have you experienced anxiety lasting several days?" Moderate
      OCD "Have you ever been bothered by unwanted, persistent thoughts?" "Have you had uncontrolled obsessive thoughts or compulsive behaviors?" Moderate
      AUD "Have you frequently been intoxicated to the extent that it interfered with work/home?" "Have you had alcohol-related problems?" Low
      Suicidal ideation "Have you ever seriously thought about committing suicide?" "Have you ever had suicidal thoughts?" High
      Table 1. Participants’ demographic data

      Table 2. Logistic regression results for lifetime prevalence of mental disorders by survey mode

      p<.05,

      p<.01,

      p<.001.

      OR represents odds for offline relative to online (reference: online). The models were controlled for age, sex, occupation, education, and income.

      SE: standard error, OR: odds ratio, CI: confidence interval, OCD: obsessive-compulsive disorder, AUD: alcohol use disorder.

      Table 3. Linear regression results for public perceptions of mental disorders by survey mode

      p<.05,

      p<.01,

      p<.001.

      The coefficients represent the group effect (offline vs online) after controlling for age, sex, occupation status, educational level, and income. Higher values indicated more negative attitudes or stronger agreement with stigmatizing beliefs. The models were evaluated using FDR-adjusted p-values for multiple comparisons.

      SE: standard error, CI: confidence interval, FDR: false discovery rate.

      Table 4. Comparison of assessment items and diagnostic criteria between offline and online surveys

      Table 4 presents a direct comparison of the assessment items used in each survey. The two surveys employed fundamentally different approaches: The offline survey used structured diagnostic questions derived from the K-CIDI with specific duration and severity criteria, whereas the online survey used simplified symptom-based questions. This instrumental nonequivalence is a critical consideration when interpreting between-survey differences and represents an inherent limitation in comparing independently designed national surveillance systems. Importantly, however, the direction of the between-survey differences was not uniform across all conditions (e.g., the reversal observed for AUD), indicating that item differences alone cannot fully explain the observed patterns. Thus, the table clarifies the item-level differences and does not imply the construction of equivalence across systems.

      K-CIDI: Korean Composite International Diagnostic Interview, OCD: obsessive-compulsive disorder, AUD: alcohol use disorder.


      STRESS : STRESS
      TOP