Knowledge Extraction System - A Template for Security Architect's Workflow Augmentation Leveraging LLM.

1.0: Problem Statement.

In the most basic sense, LLMs are a useful piece of technology for encoding information. Trained on vast datasets to understand and generate human-like text, they excel in extracting and synthesizing knowledge from diverse sources, making them powerful for information retrieval. They are a useful tool for augmenting day-to-day tasks or workflows when used right. However, LLMs can struggle with providing precise and relevant information within user prompts due to potential biases, overfitting, and lack of specific context. Privacy concerns arise as LLMs may inadvertently retain and expose sensitive data from their training sets, risking user confidentiality and data security. When organizations use third-party LLM API, there is also a risk of providing proprietary information to the model that can later be leaked.

While LLMs offer broad applicability across domains, their adoption in enterprise environments, particularly for augmenting a security architect's workflow, has been limited due to the abovementioned challenges. The proposed Knowledge Extraction System template demonstrates how LLMs can be leveraged to scale a Security Architect's workflow by utilizing concepts and technologies like RAG, prompt engineering, and controlled LLM deployment, thereby addressing the previously identified challenges

2.0: Solution.

This project aims to provide a template for the adoption of AI, specifically LLMs, for various enterprise tasks especially the workflow of a security architect. Key strategies used include:

Retrieval-Augmented Generation (RAG): RAG was leveraged to create augmented prompts that provide better context for LLM prompts, hence, improving the relevance and accuracy of the generated output. In the security use case for example, the RAG could ingest an organization's product security requirements, GRC policies and requirements from security frameworks like NIST and PCI DSS. The data from this ingested data will serve as context information for the user prompt to the LLM. That way, the final output will be generated from this context and not purely from the LLM's traning data.
Prompt Engineering: Crafted well-structured prompts for each step in a task such that the LLM has clear boundaries, enabling them to operate within defined parameters and produce more relevant results. When combined with AI agents, well-engineered prompts drove efficient task completion.
Privacy Measures: Privacy concerns can be mitigated by deploying LLMs locally within an organization, ensuring that the models are governed by the same access controls as other IT systems. Whether deployed on-premises or in a controlled cloud environment, this approach helps safeguard sensitive information. Cloud providers like AWS offer services for such deployment. An example is AWS Bedrock.

By implementing these strategies, enterprises can significantly mitigate (though not entirely eliminate) the challenges associated with LLM adoption, paving the way for more secure and effective use of AI for workflow augmentation.

3.0 System Summary.

The solution developed to address these challenges is code-named the Knowledge Extraction System (KES). KES leverages Retrieval-Augmented Generation (RAG) and sophisticated prompt engineering with AI agents to create a robust template that accomplishes specific tasks while delivering highly relevant outputs. The KES framework simplifies the process of ingesting additional data into RAG and facilitates the creation of AI agents, referred to as "crew," each tailored to perform distinct tasks.

The output from any crew is contextually relevant information, produced through well-crafted prompts that guide each agent's work. This output consolidates data from multiple sources, ensuring that the final result is comprehensive and precise. By utilizing RAG, KES significantly reduces the risk of hallucinations or the LLM relying solely on its training data, making the information it provides more accurate and dependable.

KES is versatile and can be adopted across various domains where there is a need for context-specific, relevant information with minimal risk of inaccuracies.

4.0: System Architecture.

The C4 model was used for the architecture diagram below. They illustrate the different layers and complexities of the application in stages. Level 1 presents a high-level overview, while subsequent levels (Level 2) delve into specific components/services to provide more detailed information.

4.1: Level 1 - System Context.

This provide a high level view of the system and its components.

4.2: Level 2 - Data Processing Service. Container

This service extracts information from all enterprise sources or knowledge bases. As data is placed in an object store, an event that places the data in a queue for processing is fired. The data pre-processor takes the data off the queue and processes it according to the data type (document, picture, audio, or video). NOTE: For Audio & Video, the transcript is the input in this service.

4.3: Level 2 - RAG Management Service Container.

The data from the Data Processing Service is sent here. The data is first chunked according to a pre-determined chunking metric. Embedding is created from this chunk and saved in a vector database (Postgres with PG Vector in the current case).

4.4: Level 2 - Knowledge Retrieval Service Container.

The Knowledge Retrieval Service uses augmented prompts with specialized AI agents to get the final output for a task.

4.4.1: UI Layout

5.0: Use Cases

As previously mentioned, KES can be used in any domain for any task that requires information with a reasonable amount of context and fewer generated ideas. Below are some use cases developed or currently being explored:

Use Case 1: Security Architect KES:

A security architect performs many tasks that contribute to the security of enterprise software systems. Some of those include Security Review, Code Review, and Threat Modeling. To properly perform these tasks, the Security Architect needs a detailed context of the application being reviewed in various dimensions. The use of "various dimensions" here means all the information the Security Architect needs to gather to perform their tasks. This includes:

System Understanding
System Components Information
Data Dictionary
Trust Boundaries
Threat Scenarios
Countermeasures

To use KES to augment some tasks of a Security Architect, one needs to be able to provide the LLM with the following contextual information:

Initial prompt: This will be in the form of an architectural diagram or detailed system description.
Enterprise Security Requirements: These are the high-level security requirements defined by the product security team. These generic security requirements serve to guide security reviews in various domains. These will include Data Transfer Security Requirements, Network Security Requirements, Cloud Security Requirements, IAM Security Requirements, etc. Each of these is organization and (sometimes product) specific.
Governance, Risk, and Compliance (GRC) Policies: These are policies defined by the GRC team to provide governance and compliance guidance. By providing the Enterprise Security requirement and GRC policies through RAG and utilizing an AI agent to orchestrate the steps for each task, one can easily scale some of a security architect's tasks.

Threat Modeling Test Artifacts:

Threat Modeling Augmentation Demo Video

Threat Modeling Augmentation Output File

Other use cases for this template exist outside security. One of those is Real Estate Analysis workflow.

gideonaina/kes.md