The PHQ-9 (Patient Health Questionnaire-9) is the most widely used depression screening tool in US primary care and rehabilitation settings. Based directly on DSM-5 criteria, it gives clinicians a validated, billable measure for identifying depressive disorders, setting treatment baselines, and tracking patient progress over time.This reference covers everything you need to administer, score, and document PHQ-9 results, including US billing codes, USPSTF compliance, and integration into PT and behavioral health workflows.
How to score the PHQ-9
Each of the nine questions is rated on a 0–3 frequency scale ("not at all" to "nearly every day"), producing a total score between 0 and 27. Scoring takes under two minutes and can be completed by the patient before the clinical encounter — making it practical for both in-person and telehealth settings.Clinical threshold to note: A score of 10 or above has 88% sensitivity and 88% specificity for major depressive disorder (Kroenke et al., J Gen Intern Med, 2001). Any affirmative response to Question 9 (suicidal ideation) requires immediate risk assessment, regardless of total score
Key Takeaways
PHQ-9 score ranges and clinical action
The following ranges reflect the standard clinical thresholds validated in primary care populations. Use these in conjunction with the patient's reported functional impairment (Question 10) and clinical judgment — a score alone does not constitute a diagnosis.
0–4: Minimal - monitor; reassess at next visit if clinically indicated.
5–9: Mild - atchful waiting; consider behavioral activation strategies.
10–14: Moderate - clinically significant; structured follow-up or referral warranted.
15–19: Moderately severe — active treatment indicated (psychotherapy, medication, or both).
20–27: Severe - uirgent evaluation; consider immediate referral to psychiatry.
A 5-point change between administrations is the validated minimum for a clinically meaningful shift in symptom severity.
What is the PHQ-9 and why is it used?
The Patient Health Questionnaire-9 (PHQ-9) is a widely used screening tool for identifying and monitoring depression. Clinicians use it for PHQ-9 scoring interpretation depression levels, helping determine symptom severity and guide treatment decisions in clinical settings.
What does a PHQ-9 score of 14 mean?
A PHQ-9 score 14 severity category moderate indicates moderate depression. Patients in this range may require clinical evaluation, counseling, or behavioral therapy depending on symptoms and functional impairment.
What does a PHQ-9 score of 17 indicate?
A PHQ-9 score 17 severity category typically falls under moderately severe depression. This level often requires active treatment, which may include psychotherapy, medication, or referral to a mental health specialist.
How does the PHQ-9 measure depression symptoms?
The PHQ-9 includes nine DSM-based questions that assess common depression symptoms over the past two weeks. Total scores are used for PHQ-9 scoring interpretation depression levels and help clinicians categorize patients within the standard PHQ-9 score ranges for depression severity.
How is each item scored from 0 to 3?
Total PHQ-9 scores range from 0 to 27. Question nine about suicidal thoughts needs special attention; it counts toward diagnosis if present at all, whatever the duration.
PHQ-9 in US clinical practice — billing, compliance & documentation
USPSTF recommendation: The US Preventive Services Task Force (USPSTF) gives depression screening in adults a Grade B recommendation — meaning it should be offered to all adults in primary care settings. The PHQ-9 is the most commonly used instrument for fulfilling this requirement.
Billing codes for depression screening:
— G0444: Annual depression screening, 15 minutes (Medicare/Medicaid, primary care)
— 96127: Brief emotional/behavioral assessment — used by PTs and OTs to bill for standardized screening tools including the PHQ-9
— G2211: Complexity add-on — applicable when depression screening is part of a complex care visit
Documentation tip for SOAP notes: Document the PHQ-9 total score, the date administered, who administered it, and the clinical action taken. In PT documentation, note how depression severity informs the patient's rehabilitation prognosis and goals.
PHQ-9 in physical therapy — why it matters for rehab outcomes
Depression is a significant predictor of rehabilitation outcomes. Studies show that patients with PHQ-9 scores ≥10 have longer recovery timelines and lower functional gains in musculoskeletal rehab. Screening for depression using the PHQ-9 at intake — and reassessing at 4–6 week intervals — allows PTs to adjust treatment plans, set realistic patient goals, and make timely referrals to behavioral health providers.
The PHQ-9 is particularly relevant in chronic pain populations, post-surgical recovery, and patients with neurological conditions where depression comorbidity is common. Spry's PROMs platform allows clinicians to send the PHQ-9 digitally, track scores across visits, and link results directly to SOAP documentation.
PHQ-9 vs GAD-7 — choosing the right tool for comorbid anxiety
Because depression and anxiety frequently co-occur — particularly in chronic pain and post-surgical patients — many US clinicians administer both the PHQ-9 and the GAD-7 (Generalized Anxiety Disorder 7-item scale) at intake. Together, they take under five minutes and provide a complete picture of common mental health comorbidities.
Use the PHQ-9 when the primary concern is depressive symptoms, treatment response monitoring, or USPSTF compliance. Use the GAD-7 when anxiety, worry, or somatic symptoms are the presenting concern. For patients with both, administer both — most EHRs and PROMs platforms support dual administration in a single patient session.
Integrating PHQ-9 into measurement-based care (MBC)
Measurement-Based Care (MBC) — the systematic use of validated outcome measures to guide clinical decisions — is now expected by many US payers, ACOs, and value-based care contracts. The PHQ-9 is one of the most widely mandated tools in MBC frameworks.
Best practice: administer at intake, at 4 weeks, and at 8 weeks (or at every 3rd visit). A ≥5-point improvement by week 8 is associated with sustained remission. If no improvement is seen by week 4, consider escalating care intensity or co-treatment referral. Spry automatically tracks PHQ-9 trends across visits and flags patients who are not improving on schedule.
Understanding PHQ-9 scoring and interpretation
A proper assessment and treatment plan needs both numerical scores and clinical significance when interpreting PHQ-9 results.
Mild, moderate, and severe depression categories
Patients who score in the mild range (5-9) usually have no clinical depression or just below-threshold symptoms. These cases might only need monitoring and follow-up checks.
A treatment plan becomes necessary for scores in the moderate range (10-14). This could include counseling, regular check-ins, or medication.
Patients scoring in the moderately severe (15-19) and severe (20-27) ranges need active treatment. This often includes antidepressants, psychotherapy, or both approaches combined.
What is considered a 'positive' screen for depression?
A PHQ-9 score of 10 or higher points to a positive depression screen, with 88% sensitivity and specificity for major depression. This cutoff helps identify real cases while keeping false positives low.
A major depressive disorder diagnosis needs more than just the score. A formal diagnosis requires five or more symptoms marked as "2" or "3," and at least one must be depressed mood or anhedonia (questions 1 and 2).
How to interpret scores in the 10–14 'gray zone'?
The 10-14 score range includes patients who might or might not have major depression. This range has a positive likelihood ratio of 2.6 for major depression. Doctors should look at how long symptoms last and how they affect daily life.
The additional functional impairment question is vital. An answer of "somewhat difficult" or worse points to depression that needs treatment. That's why scores in this gray zone need a full clinical assessment rather than just looking at numbers.
Reliability, validity, and clinical use of PHQ-9
Research shows the PHQ-9 has excellent measurement properties in a variety of populations. The tool's internal consistency remains high, with Cronbach's α ranging from 0.86-0.89 in primary care studies. Its test-retest reliability stands impressively at 0.82 (95% CI: 0.74-0.90), which proves its stability over time.
How reliable is the PHQ-9 across different populations?
The PHQ-9's strong psychometric qualities remain consistent in demographic groups of all types. Medical professionals have translated it into more than 70 languages, and the questionnaire works reliably across cultural contexts. Some studies reveal differences in how items function between language versions, especially for physical symptoms like sleep and appetite. Neurological populations benefit particularly from the questionnaire's strength, with sensitivity above 79% and specificity higher than 78% at the standard cut-point.
PHQ-9 vs other depression screening tools
The PHQ-9 shows better diagnostic odds ratios and area under the curve statistics than alternatives. The PHQ-2 offers a shorter option with similar sensitivity (91.8%) but lower specificity (67.7%) compared to the full PHQ-9. Postpartum depression screening works better with the Edinburgh Postnatal Depression Scale than the PHQ-9. Quick screening becomes possible through a sequential approach - PHQ-2 followed by PHQ-9 - which cuts assessment time by about 57% while keeping diagnostic accuracy.
Use in primary care and mental health settings
Primary care physicians value the PHQ-9's efficiency since it takes less than three minutes to review. The tool's objective measurement capabilities make it even more valuable. Clinicians find it especially useful to track treatment response over time. Many healthcare providers use the instrument creatively beyond depression screening, including ways to address treatment resistance.
Limitations and considerations when using PHQ-9
The PHQ-9's limitations deserve attention despite its strengths. Clinical judgment still matters more than the tool alone. Psychiatric specialty settings see lower specificity and positive predictive values compared to primary care. Conditions like schizophrenia, panic disorder, and bipolar disorder often lead to false positives. A newer study published in suggests the PHQ-9 might overdiagnose depression - 24.6% of participants met PHQ-9 criteria while only 12.1% showed depression in structured interviews.
Conclusion
The PHQ-9 is a quick and reliable tool for screening and monitoring depression across many healthcare settings. Its standardized PHQ-9 scoring interpretation depression levels help clinicians categorize symptoms using clear PHQ-9 score ranges for depression severity.
Widely adopted by health systems like the Veterans Administration and the UK’s NHS, the PHQ-9 supports both initial screening and ongoing treatment monitoring. For example, a PHQ-9 score 14 severity category moderate may require closer clinical evaluation, while a PHQ-9 score 17 severity category typically indicates moderately severe depression that may need active treatment.
While highly effective, the PHQ-9 works best when combined with clinical judgment and a comprehensive patient assessment to guide appropriate care.
FAQs
Q1. What does a PHQ-9 score indicate?
PHQ-9 scores range from 0 to 27, with different ranges indicating varying levels of depression severity. Scores below 5 typically suggest no depression, 5-9 indicate mild symptoms, 10-14 represent moderate depression, 15-19 signify moderately severe depression, and scores of 20 or above indicate severe depression.
Q2. How is the PHQ-9 used in healthcare settings?
The PHQ-9 is a versatile tool used in various healthcare settings, primarily for screening and monitoring depression. It's commonly used in primary care offices, mental health clinics, and obstetrics-gynecology practices. The questionnaire helps healthcare providers quickly assess depression symptoms and track treatment progress over time.
Q3. What is considered a positive result on the PHQ-9?
A score of 10 or higher on the PHQ-9 is generally considered a positive screen for depression. This threshold balances sensitivity and specificity, both at 88% for major depression. However, a formal diagnosis requires additional clinical assessment beyond just the numerical score.
Q4. How long does it take to complete the PHQ-9?
The PHQ-9 is a brief questionnaire that typically takes less than three minutes to complete. This efficiency makes it particularly valuable in busy clinical settings where time is often limited.
Q5. Can the PHQ-9 diagnose depression on its own?
While the PHQ-9 is a highly reliable screening tool, it cannot diagnose depression on its own. It's designed to be used in conjunction with clinical judgment and further assessment. Healthcare providers should consider the PHQ-9 score alongside other factors, including the patient's history and functional impairment, to make a comprehensive evaluation.
Reduce costs and improve your reimbursement rate with a modern, all-in-one clinic management software.
Get a Demo

