世界科技研究与发展 ›› 2026, Vol. 48 ›› Issue (1): 69-79.doi: 10.16507/j.issn.1006-6055.2025.12.005 cstr: 32308.14.1006-6055.2025.12.005

• 科技前沿与态势 • 上一篇    

大语言模型隐私保护应用与自身风险防御研究

卫霞1 张文俊2   

  1. 1.西安明德理工学院;2.国家计算机网络应急技术处理协调中心陕西分中心
  • 发布日期:2026-02-28

Research on Privacy Protection Applications of Large Language Models and Defense Against Their Own Risks

WEI Xia1 ZHANG Wenjun2   

  1. 1.Xi'an Mingde Institute of Technology; 2.Shaanxi Branch of National Computer Network Emergency Response Technical Team/Coordination Center
  • Published:2026-02-28

摘要: 随着大语言模型(LLMs)在各个领域的广泛应用,其在隐私治理及自身风险方面的问题日益凸显。本文系统探讨了LLMs在隐私保护中的双面性:一方面,LLMs作为智能工具可增强数据安全防护能力,如提升代码漏洞检测任务的准确率;另一方面,面临梯度泄露、成员推理及个人身份信息泄漏等典型隐私攻击,LLMs具有较大的隐私风险。基于我国网络空间治理的法律框架,按照预防义务、处理规则、权利保障、事件响应的结构,本文梳理LLMs在隐私保护方面的合规性应用情况;同时分析针对LLMs隐私的典型攻击及防御手段,并从“数据—训练—推理”全生命周期角度探讨了提升LLMs安全性的举措,指出未来需着重解决模型规模扩张与隐私保护需求之间的根本矛盾。

关键词: 大语言模型;防御机制;隐私保护;数据清洗;监督微调

Abstract: With the widespread application of large language models (LLMs) across various fields, issues related to privacy governance and their own risks have become increasingly prominent. This paper systematically explores the dual nature of LLMs in privacy protection: on one hand, LLMs, as intelligent tools, can enhance data security capabilities, such as improving the accuracy of code vulnerability detection tasks; on the other hand, they face typical privacy attacks, including gradient leakage, membership inference, and personal identity information disclosure, posing significant privacy risks. Based on China’s legal framework for cyberspace governance, this paper reviews the compliance applications of LLMs in privacy protection according to the structure of preventive obligations, processing rules, rights protection, and incident response. It also analyses typical privacy attacks and defense methods targeting LLMs, and discusses measures to enhance LLM security from a full lifecycle perspective of “data-training-inference”, highlighting the fundamental conflict between model scale expansion and privacy protection needs that must be addressed in the future.

Key words: Large Language Models; Defense Mechanism; Privacy Protection; Data Cleaning; Supervision Fine tuning