ExplorerComputer ScienceCybersecurity
Research PaperResearchia:202606.24014

HelpBench: Assessing the Ability of LLMs to Provide Privacy, Safety, and Security Advice

Sarah Meiklejohn

Abstract

This paper introduces HelpBench, a benchmark for assessing whether LLMs are capable of providing accurate help in response to questions about digital privacy, safety, and security. We curated 450 questions representing authentic user situations and developed rubrics for each question to evaluate the factual accuracy and tone of a response. Example questions touch on how to regain access to lost or suspended accounts, how to balance the trade-offs of hardware security keys versus other forms of t...

Submitted: June 24, 2026Subjects: Cybersecurity; Computer Science

Description / Details

This paper introduces HelpBench, a benchmark for assessing whether LLMs are capable of providing accurate help in response to questions about digital privacy, safety, and security. We curated 450 questions representing authentic user situations and developed rubrics for each question to evaluate the factual accuracy and tone of a response. Example questions touch on how to regain access to lost or suspended accounts, how to balance the trade-offs of hardware security keys versus other forms of two-factor authentication, whether a suspicious email is likely a scam, or whether an abuser might be able to track an individual based on their device peripherals. We then developed and applied an auto-rater to evaluate responses from 18 state-of-the-art LLMs. Our results indicate that while models provide high-quality advice (with scores of 82% on average), one in ten responses from models scores less than 65%, reflecting inaccurate and even harmful advice. Addressing these failures is critical for models to serve as trustworthy sources of assistance for digital privacy, safety, and security needs.


Source: arXiv:2606.24819v1 - http://arxiv.org/abs/2606.24819v1 PDF: https://arxiv.org/pdf/2606.24819v1 Original Link: http://arxiv.org/abs/2606.24819v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Jun 24, 2026
Topic:
Computer Science
Area:
Cybersecurity
Comments:
0
Bookmark