Securing LLMs: Best Practices for Enterprise Deployment

Author: Vaibhav Malik, ISC2 CC, Global Partner Solution Architect, Cloudflare
Date Published: 1 November 2024
Read Time: 16 minutes
Related: Artificial Intelligence: A Primer on Machine Learning, Deep Learning, and Neural Networks | Digital | English

Large language models (LLMs) have emerged as a transformative force in the rapidly evolving information technology landscape, offering unprecedented capabilities in natural language processing, content generation, and decision support. Integrating LLMs into enterprise operations is not merely a technological upgrade; it represents a fundamental shift in how organizations process information, interact with customers, and make decisions.

However, as with any emerging technology, adopting LLMs introduces new vulnerabilities and risk factors that must be carefully managed. From data privacy concerns to the potential for malicious manipulation, the security implications of LLM deployment are far-reaching and complex. Organizations must develop comprehensive strategies for secure LLM deployment in enterprise settings.

Understanding LLM Vulnerabilities

To effectively secure LLMs, it is crucial to first understand their underlying architecture and the unique vulnerabilities this architecture creates. LLMs are complex neural networks trained on vast datasets, using sophisticated algorithms to predict and generate human-like text. This capability, while powerful, also introduces several vital vulnerabilities.

Prompt Injection Attacks
Prompt injection attacks exploit the LLM’s core functionality by crafting inputs that manipulate the model into performing unintended actions. For example, an attacker might inject instructions into a prompt that causes the LLM to ignore previous safety constraints or reveal sensitive information. The OpenAI research team demonstrated this vulnerability in their paper “Constitutional AI: Harmlessness from AI Feedback,” highlighting the need for robust input validation mechanisms.¹ They showed how models could be manipulated to produce harmful content despite being trained to be safe and helpful. This research underscores the need for robust input validation mechanisms and highlights the challenges in creating artificial intelligence (AI) systems that reliably adhere to specified rules and constraints.

Data Poisoning
During the training phase, if malicious data is introduced into the training set, it can lead to a compromised model. This “poisoned” model might produce biased outputs, leak sensitive information, or behave erratically in specific scenarios. A study by Wan et al. demonstrated how even a small amount of poisoned data can significantly impact model behavior.² They showed that by poisoning just 0.02% of the instruction tuning data, they could consistently manipulate a model to generate harmful content about a target entity.

Model Inversion and Extraction
These attacks aim to reverse-engineer the model or extract the data on which it was trained. In model inversion, an attacker tries to recreate training data by observing the model’s outputs. Model extraction involves creating a knockoff model by repeatedly querying the original model and using its outputs for training. Research by Carlini et al. in “Extracting Training Data from Large Language Models” showed the feasibility of extracting personal information from LLMs, underscoring the importance of robust data protection measures.³ The research showed that it was possible to extract verbatim sequences from the training data of GPT-2, a predecessor to more advanced models such as GPT-3 and GPT-4.

From data privacy concerns to the potential for malicious manipulation, the security implications of LLM deployment are far-reaching and complex. Organizations must develop comprehensive strategies for secure LLM deployment in enterprise settings.

Adversarial Examples
These are inputs specifically designed to cause the model to make mistakes. In the context of LLMs, this could involve subtle modifications to prompts that result in harmful or incorrect outputs. The work of Wang et al. in “Adversarial Demonstration Attacks on Large Language Models” demonstrates a novel attack method that exploits LLMs’ in-context learning capabilities, highlighting the potential vulnerabilities in these systems.⁴

The various attack vectors discussed, from prompt injection to adversarial attacks, underscore the complex security landscape surrounding LLMs. As these models become more prevalent in enterprise settings, understanding and mitigating their risk becomes crucial.

Enterprise Risk of Unsecured LLMs

The vulnerabilities inherent in LLMs present significant risk for enterprises. There are several risk types that organizations should be aware of, each with real-world implications.

Data Privacy Breaches
LLMs often handle sensitive information, from customer data to internal enterprise communications. If compromised, an LLM could inadvertently reveal this confidential information, leading to privacy violations and potential legal consequences. For example, in 2023, Samsung banned its employees from using ChatGPT and other generative AI tools after discovering that employees had uploaded sensitive internal source code to ChatGPT, potentially exposing proprietary information.⁵

The European Union’s General Data Protection Regulation (GDPR) and similar laws worldwide have made such breaches particularly costly.⁶

Intellectual Property Theft
Enterprises using LLMs for tasks involving proprietary information risk exposing their intellectual property. For example, if a model inversion attack were to succeed, it could potentially enable competitors to uncover sensitive information regarding proprietary business strategies or upcoming product developments.

Reputational Damage
An LLM generating biased, offensive, or incorrect content due to security flaws could severely damage an enterprise’s reputation. This is particularly risky for customer-facing applications of LLMs, where a single inappropriate response could gain a lot of attention and cause significant backlash. For example, in 2016, Microsoft’s AI chatbot Tay was taken offline within 24 hours of its launch after it began posting offensive tweets.⁷

Compliance Violations
In heavily regulated industries, an unsecured LLM could lead to breaches of compliance standards. For example, in healthcare, an LLM accidentally revealing patient information would violate Health Insurance Portability and Accountability Act (HIPAA) regulations in the United States.⁸

Financial Losses
The culmination of risk—from legal penalties for data breaches to lost business due to reputational damage—can result in significant financial losses for the enterprise. The IBM Cost of a Data Breach Report 2024 found that the average cost of a data breach reached an all-time high of US$4.88 million.⁹

Best Practices for Secure LLM Deployment

To mitigate LLM risk, enterprises should implement the following best practices:

Robust Model Training and Selection

Vet training data meticulously to prevent data poisoning attacks.
Choose appropriate model sizes and capabilities that align with specific use cases, avoiding unnecessarily powerful models that could introduce additional risk.
Implement adversarial training techniques to improve model resilience against malicious inputs.¹⁰

Secure Infrastructure

Deploy LLMs within isolated environments, separating them from other critical systems.
Implement stringent access controls and multifactor authentication (MFA) measures to prevent unauthorized access.
Ensure that all interactions with the LLM occur over encrypted communication channels.

Input Sanitization and Validation

Implement strong input filtering mechanisms that detect and neutralize potential malicious prompts.
Utilize a separate pre-processing layer that sanitizes inputs before they reach the LLM.
Employ AI-powered anomaly detection systems to identify unusual or potentially harmful input patterns.¹¹

Output Filtering and Monitoring

Implement content filters that screen LLM outputs for potentially sensitive or inappropriate content.
Establish continuous monitoring systems that analyze LLM outputs for anomalies or security breaches.
Utilize privacy-preserving techniques such as differential privacy¹² to add noise to outputs, reducing the risk of information leakage.

Regular Security Audits and Penetration Testing

Develop LLM-specific security testing protocols that address unique vulnerabilities. These protocols might include:
- Prompt injection testing—Systematically test the LLM with various crafted prompts designed to bypass safety measures or elicit unauthorized responses. This could involve creating a database of potentially malicious prompts and regularly updating it based on newly discovered vulnerabilities
- Data extraction attempts—Implement tests that extract training data or sensitive information from the model. This could involve techniques such as model inversion attacks or membership inference attacks.
- Adversarial example testing—Use adversarial examples specific to text-based models to test the LLM’s robustness. This might involve subtle word substitutions or syntactic changes that aim to alter the model’s output dramatically.
- Fine-tuning vulnerability assessment—If the LLM allows for fine-tuning, test the security of this process by attempting to introduce backdoors or biases through malicious data.
- Output consistency checking—Develop automated tools to check for inconsistencies in the LLM’s outputs across multiple runs or slight variations in input. Significant inconsistencies could indicate vulnerabilities.
Conduct regular vulnerability assessments and penetration testing tailored to LLM systems.
Engage third-party security experts to perform independent audits of LLM deployments.

Employee Training and Awareness

Staff should be educated on LLM security risk and best practices for interaction through a multistep approach:

Mandatory onboarding training—All employees who interact with LLMs should undergo a comprehensive onboarding session that covers basic LLM security principles, potential risk, and enterprise policies.
Regular professional development workshops—Organize quarterly or biannual workshops led by AI security experts. These could be in-person or virtual sessions that address specific LLM security topics and emerging threats.
Online learning modules—Develop a series of self-paced, interactive elearning modules that employees can access at any time. These modules could cover various aspects of LLM security and end with assessments to ensure comprehension.
Simulated phishing-style exercises—Conduct exercises that simulate LLM-specific security threats (e.g., prompt injection attempts) to test employee vigilance and provide hands-on learning experiences.
Lunch-and-learn sessions—Host informal, optional sessions where employees can discuss recent developments in LLM technology and security to foster a culture of continuous learning.
Certificationprograms—Offer internal or external certification programs for employees who want to become LLM security champions within their teams.
Annual mandatory refresher courses—Require all employees to complete an annual refresher course that covers any updates to LLM security best practices and enterprise policies.

Ethical Considerations and Governance

An AI ethics committee can be created to oversee LLM deployments and usage. This committee should be established while keeping in mind:

Composition
- AI/ML experts who understand the technical aspects of LLMs
- Legal professionals versed in data protection and AI regulations
- Ethicists or philosophers to address moral implications
- Domain experts from relevant business units
- Human resources (HR) representatives to consider employee impact
- Customer advocates to represent end-user interests
- Diversity and inclusion specialists to address bias concerns
Requirements for Membership
- Demonstrated expertise in the field (e.g., advanced degree, years of experience)
- Understanding of AI technologies and their societal impacts
- Commitment to ethical decision making and impartiality
- Ability to balance technical, business, and ethical considerations
- Ongoing education in AI ethics and related fields
Responsibilities
- Review and approve LLM deployments before implementation
- Conduct regular audits of existing LLM applications
- Develop and update ethical guidelines for LLM use
- Investigate and respond to ethical concerns or incidents
- Provide guidance on complex ethical dilemmas related to LLM use
- Engage with external stakeholders on AI ethics issues
- Establish clear accountability measures for LLM-related decisions and outputs¹³
Authority
- Power to halt or modify LLM deployments that do not meet ethical standards
- Direct reporting line to C-suite executives to ensure that ethical considerations are prioritized
Meeting Frequency
- Regular scheduled meetings (e.g., monthly) to review ongoing projects
- Ad-hoc meetings as needed to address urgent ethical issues

This expanded structure for the AI ethics committee ensures that ethical considerations are thoroughly integrated into the organization’s LLM deployment and usage strategies. By including a diverse group of experts and giving them absolute authority, organizations can better navigate the complex ethical landscape of AI and LLMs.

Case Study: Journey to Safe LLM Deployment

To illustrate the practical application of these best practices, consider the case of a fictional mid-sized financial services company that decided to implement an LLM-powered customer service chatbot.

Initial Deployment and Challenges
The company initially rushed to deploy its LLM chatbot, eager to improve customer service efficiency. However, it quickly encountered several security issues:

The chatbot occasionally revealed sensitive customer information in its responses.
There were instances of the model generating biased or inappropriate content.
The security team detected attempts at prompt injection attacks from unknown sources.
Employees were using the chatbot for tasks it was not designed for, potentially exposing company data.

These issues led to customer complaints, potential compliance violations, and increased security risk.

While quantum algorithms could potentially enhance LLM performance and security, they also threaten to break current encryption standards. Enterprises must start preparing for post-quantum cryptography to secure their LLM deployments in the future.

Implementing Best Practices
Recognizing the need for a more secure approach, the leadership decided to revamp their LLM deployment strategy using several steps:

Robust Model Training and Selection
- Organizational leadership partnered with a reputable AI enterprise to obtain a pretrained model specifically designed for financialservices.
- The model was fine-tuned using carefully vetted, anonymized customer interaction data.
Secure Infrastructure
- The LLM was deployed in an isolated cloud environment with strict access controls.
- All interactions with the LLM were encrypted end-to-end.
Input Sanitization and Validation
- A preprocessing layer was implemented to filter out potential prompt injection attempts.
- An AI-powered anomaly detection system was put in place to flag unusual input patterns.
Output Filtering and Monitoring
- A content filter was implemented to screen outputs for sensitive information or inappropriate content.
- A dedicated team was assigned to monitor chatbot interactions in real time.
Regular Security Audits and Penetration Testing
- Quarterly security audits were scheduled, including LLM-specific penetration testing.
- An independent cybersecurity firm was engaged for annual comprehensive audits.
Employee Training and Awareness
- All employees underwent mandatory training on the appropriate use of the LLM chatbot.
- Clear usage policies were established and communicated companywide.
Ethical Considerations and Governance
- An AI ethics committee was formed, including representatives from legal, compliance, and customer service departments.
- The committee developed guidelines for responsible AI use and regularly reviewed chatbot performance for potential biases.

Results
Six months after implementing these best practices, the company saw significant improvements:

No further incidents of sensitive data exposure were reported.
Prompt injection attempts were consistently detected and blocked.
Employee misuse of the chatbot decreased by 95%.
Customer satisfaction with the chatbot service increased by 40%.
The company passed a regulatory audit with flying colors, with auditors specifically commending their LLM security measures.

Future Considerations

As LLMs continue to evolve and integrate more deeply into enterprise operations, several emerging trends and challenges warrant attention.

Quantum Computing and LLMs
The advent of quantum computing poses both opportunities and threats for LLM security. Quantum algorithms could potentially enhance LLM performance and security in several ways:

Faster training—;Quantum algorithms could dramatically speed up the training process for LLMs, allowing for more complex models to be developed in shorter timeframes.
Enhanced cryptography—Quantum key distribution could provide unbreakable encryption for LLM data and communications.
Improved optimization—Quantum algorithms could help find more optimal solutions for LLM architectures and hyperparameters.
More efficient data processing—Quantum computing could enable more efficient processing of high-dimensional data, potentially leading to more powerful and accurate LLMs.

Federated Learning for Enhanced Privacy
Federated learning techniques, where models are trained across multiple decentralized devices or servers holding local data samples, could provide a solution to some of the privacy concerns associated with LLMs. This approach allows for model improvement without centralizing sensitive data, potentially reducing the risk of large-scale data breaches.¹⁵

Explainable AI in LLMs
As regulatory scrutiny increases, the need for explainable AI (XAI) in LLMs will become more pressing. This is particularly important given the black box nature of many current LLM models. Black box AI models, including many LLMs, make decisions or predictions without providing clear insight into their internal workings or decision-making processes. This lack of transparency can be problematic in several ways:

Lack of accountability—When decisions are made by black box models, it is difficult to assign responsibility or understand why errors occur.
Potential for bias—Hidden biases in the training data or model architecture can lead to unfair or discriminatory outcomes without clear ways to identify or correct them.
Regulatory compliance—Many industries require explainable decision-making processes, which black box models struggle to provide.
User trust—Without understanding how AI systems make decisions, users may be hesitant to rely on them, especially for critical applications.

To address these issues, future LLM deployments may need to provide clear explanations for their outputs, especially in high-stakes domains such as finance or healthcare. This could involve developing new techniques for interpreting and explaining LLM decision-making processes, such as:

Attention visualization—Highlighting which parts of the input the model focused on to generate its output
Counterfactual explanations—Showing how the output would change if certain inputs were different
Natural language explanations—Training models to generate human-readable explanations alongside their primary outputs

This could involve developing new techniques for interpreting and explaining LLM decision-making processes.¹⁶

Adaptive Security Measures
The rapid pace of AI advancement requires more than mere static security measures. Future LLM security strategies will likely call for adaptive, AI-powered cybersecurity measures that can evolve to counter emerging AI-driven threats. As highlighted recently, AI-powered cybersecurity tools are becoming essential to combat sophisticated AI-driven cyberattacks.¹⁷

These adaptive measures may include:

Real-time threat detection—AI that can identify and respond to new types of attacks as they emerge
Automated patch generation—AI that can automatically generate and deploy security patches for newly discovered vulnerabilities
Predictive defense—AI that can predict potential future attack vectors and preemptively strengthen defenses

Regulatory Landscape
As governments and international bodies grapple with the implications of AI and LLMs, we can expect to see a more complex regulatory environment. Enterprises will need to stay agile and ready to adapt their LLM deployments to comply with evolving legal and ethical standards.¹⁸

Conclusion

LLMs represent a significant leap forward in AI capability, offering enterprises powerful tools for innovation and efficiency. However, their deployment also introduces unique security challenges that must be addressed proactively and comprehensively.

The best practices outlined here–from robust model training and secure infrastructure to ethical governance and regular auditing–provide a framework for secure LLM deployment. However, organizations must recognize that these practices are not a one-time implementation, but rather an ongoing process. The fictional case study illustrates that successful LLM security requires continuous monitoring, adaptation, and improvement.

Moreover, looking to the future, it is clear that the landscape of LLM security will continue to evolve. Emerging technologies such as quantum computing and federated learning will introduce new challenges and opportunities. Regulatory changes will demand greater transparency and explainability from AI systems. In this dynamic environment, enterprises must remain vigilant and adaptable.

Ultimately, the key to successful and secure LLM deployment lies in striking a balance between innovation and caution. By embracing the potential of LLMs while rigorously addressing their security implications, enterprises can harness these powerful tools to drive growth and efficiency without compromising on security and ethics.

As digital trust professionals, our role is to lead the way in this new frontier, ensuring that as our AI systems grow more powerful, our security measures evolve to match. The future of AI is bright, but only if we light the way with robust, thoughtful, and adaptive security practices.

Endnotes

¹ OpenAI, “Constitutional AI: Harmlessness from AI Feedback,” arXiv, 2022
² Wan, A.; Wallace, E.; et al.; “Poisoning Language Models During Instruction Tuning,” arXiv, 1 May 2023
³ Carlini, N.; et al.; “Extracting Training Data from Large Language Models,” USENIX Security Symposium, 2021
⁴ Wang, J.; Liu, Z.; et al.; “Adversarial Demonstration Attacks on Large Language Models,” arXiv, 24 May 2023
⁵ Gurman, M.; “Samsung Bans Use of Generative AI Tools Like ChatGPT After Leak,” Bloomberg, 2 May 2023
⁶ GDPR.EU, “General Data Protection Regulation (GDPR),”
⁷ Victor, D.; “Microsoft Created a Twitter Bot to Learn From Users. It Quickly Became a Racist Jerk,” The New York Times, 24 March 2016
⁸ U.S. Department of Health & Human Services, “Health Information Privacy,” USA
⁹ IBM, Cost of a Data Breach Report 2024, 2024
¹⁰ Goodfellow, I.J.; et al.; “Explaining and Harnessing Adversarial Examples,” arXiv, 20 December 2014
¹¹ Hendrycks, D.; et al.; “Natural Adversarial Examples,” arXiv, 16 July 2019
¹² Dwork, C.; Roth, A.; “The Algorithmic Foundations of Differential Privacy,” Foundations and Trends in Theoretical Computer Science, vol. 9, 2014, p. 211. 407
¹³ Institute of Electrical and Electronics Engineers (IEEE), “The IEEE Global Initiative on Ethics of Autonomous and Intelligent Systems,” 2019
¹⁴ National Institute of Standards and Technology, “Post-Quantum Cryptography,” USA, 2022
¹⁵ McMahan, H.B.; et al.; Communication-Efficient Learning of Deep Networks from Decentralized Data,” arXiv, 2016
¹⁶ Gunning, D.; et al.; “XAI—Explainable Artificial Intelligence,” Science Robotics, 18 December 2019
¹⁷ Muraleedhara, P.; “The Need for AI-Powered Cybersecurity to Tackle AI-Driven Cyberattacks,” ISACA^® Now Blog, 23 April 2024
¹⁸ European Commission, “Proposal for a Regulation Laying Down Harmonised Rules on Artificial Intelligence,” European Union

VAIBHAV MALIK | ISC2 CC

Is a Global Partner Solution Architect at Cloudflare, where he works withglobal partners to design and implement effective security solutions for their customers. With more than 12 years of experience in networking andsecurity, Malik is a recognized industry thought leader and expert in zero trust security architecture.

Home / Resources / ISACA Journal / Issues / 2024 / Volume 6 / Securing LLMs