All published articles of this journal are available on ScienceDirect.
Association, Correlation, and Causation: Three Important and Misleading Terms in Research
Abstract
This study investigates three key ideas in statistics: association, correlation, and causation. When two variables change together, it does not necessarily mean that one variable causes the other to change. An association indicates that certain variables tend to appear or vary together, without confirming a direct link. Correlation goes a step further by measuring how strongly and in what direction two variables are linearly related, yet it still does not establish a cause-and-effect connection. Causation is the strongest form of relationship, yet proving it demands well-planned studies that account for potential confounding factors. Understanding the distinctions between these concepts is crucial for accurate data interpretation, sound study design, and the prevention of misleading conclusions.
Dear editor,
Association, correlation, and causation are fundamental ideas in statistics, but they are often misinterpreted or conflated. Association refers to a relationship between two variables in which a change in one tends to coincide with a change in the other. However, this association does not necessarily mean that one variable directly causes a change in the other [1]. In contrast, correlation is a more precise way of assessing the relationship between two variables. This relationship is usually expressed using a correlation coefficient between -1 and 1, which indicates the strength of the relationship in a linear fashion [2, 3]. It simply shows that two variables are connected or tend to vary together, without demonstrating that one directly produces changes in the other [3]. The third, and most complex, of the three concepts is causality. This concept indicates that a change in one variable directly causes a change in the other. Demonstrating this type of relationship usually requires carefully designed experiments to ensure that the observed effect is actually caused by the variable under investigation [4].
Researchers use validated methodological approaches to determine whether a causal claim is justified. In fields such as epidemiology, the Austin-Bradford-Hill criteria provide a structured framework for assessing whether an observed relationship is likely to be causal. These criteria consider aspects such as the timing of events, consistency of findings across studies, biological plausibility, and the existence of dose-response relationships. Modern approaches to causal inference also use counterfactual models, which assess causality by comparing actual outcomes with hypothetical outcomes under different conditions. Directed acyclic graphs (DAGs) are often used to map hypothetical causal pathways and identify potential confounding variables. Using these tools allows researchers to draw more robust conclusions and avoid mistaking simple associations or correlations for definitive evidence of cause and effect.
Research methodology highlights the importance of distinguishing between association, correlation, and causality. If these concepts are not carefully separated, study findings can easily be misinterpreted [5]. A careful analysis of rejected studies in biomedical research shows that a large number of these rejections are due to incorrect claims about causality, misuse of analytical techniques, or misunderstanding of statistical correlations [6]. These observations highlight the importance of distinguishing true causal relationships from descriptive associations, as well as the broader consequences that can arise when research concepts are misinterpreted.
This study highlights the importance of distinguishing between association, correlation, and causation. For example, careful reviews of randomized controlled trials conducted during the COVID-19 pandemic have shown that authors sometimes overstate the causal significance of their results, even when the trial data do not fully support such claims [7]. This pattern highlights the ongoing risk of confusing correlations with true causal effects, particularly in high-pressure research environments. Similarly, long-term assessments of methodological quality in fields such as preclinical urology have revealed significant inconsistencies, including inconsistent adherence to the standards required to make reliable causal claims [8].
Conducting sound research requires a clear understanding of the differences between association, correlation, and causation, as confusing them can lead to misleading scientific conclusions. For example, some influential observational studies on nutrition and chronic disease have mistakenly inferred causal relationships from observed associations. By carefully applying established causal frameworks, researchers can more accurately report their findings [9, 10].
In areas such as healthcare, where decisions based on statistical data can have real-world impacts on people’s lives, confusing association, correlation, and causation can lead to incorrect conclusions [11]. Therefore, it is crucial to use sound judgment and critical thinking when evaluating statistical relationships in research. Researchers should be transparent about their methods and results. Ultimately, a proper understanding of these concepts enables better decision-making and contributes to a more informed society.
AUTHORS’ CONTRIBUTIONS
The authors confirm contribution to the paper as follows: F.M.: Contributed to conceptualization, review, project management, writing the original draft, reviewing, and editing; S.B.: Contributed to writing the original draft, reviewing, and editing.
ACKNOWLEDGEMENTS
Declared none.

