Customising LLMs#

Frameworks#

Framework	Code requirements	Description
LangChain	Pro-code
Dust	??Low-code	GUI for configuring and chaining blocks
Steamship	??	Combine prompts, prompt-chains, and Python code and combine into a managed API
Retune		Simpler, focus on prompt and chatbot session management & creating fine-tuned models

Other tools#

tiktoken - tokenise and quantify prompts, from OpenAI
llamabot - class hierarchy for creating bots github

I think llamabot can help facilitate experimentation and prototyping by making some repetitive things invisible.

Examples#

Getting started with LangChain
Developer-first guide to LLM APIs - blog summary example implemented with OpenAI Python API, LangChain and LlamaIndex API.

The short answer is that we'll see more customization to local contexts, with highest ROI use cases being prioritized.

LLM Embedding and Fine Tuning #

Both fine-tuning and embeddings have challenges.

Fine-tuning concentrates on teaching the model new tasks via transfer learning, while semantic embeddings involve converting the text's meaning into a numerical representation, which can be employed in tasks such as semantic search and information retrieval

Summary#

Fine-tuning GPT-3/3.5/4

Teaches new tasks or patterns
Originally created for image models, now applies to NLP tasks
Used for classification, sentiment analysis, and named entity recognition
Does not teach new information, only new tasks
Prone to confabulation and hallucination
Expensive, slow, and difficult to implement
Not scalable for large datasets

Embedding & Semantic Search

Also known as neural search or vector search
Adds to the LLMs knowledge base
Uses semantic embeddings to represent text meaning
Scales well, fast, and cost-effective
Searches based on context and topic, not just keywords
Easily updates with new information
Solves half of the QA problem by retrieving relevant information

Comparing Fine-tuning and Semantic Search Fine-tuning

Slow, difficult, and expensive
Prone to confabulation
Teaches new tasks, not new information
Requires constant retraining
Not ideal for QA tasks

Semantic Search

Fast, easy, and cheap
Recalls exact information
Easy to add new information
Scalable and efficient
Solves half of QA tasks by retrieving relevant documents

Document embeddings #

Provides an overview of a specific process. Breaking it down and pointing to related software that can help. LangChain - a Python project, but also a javascript version - appears to be an attempt to provide a higher order abstraction than rolling your own entirely.

Echoing the idea being first-llm-api-experiments

Say you have a website that has thousands of pages with rich content on financial topics and you want to create a chatbot based on the ChatGPT API that can help users navigate this content. You need a systematic approach to match users’ prompts with the right pages and use the LLM to provide context-aware responses. This is where document embeddings can help.

Presents the following high level model. Planning to use document embeddings to provide the context aware information

Which translates into this specific

1- The user enters a prompt 2- Create the embedding for the user prompt 3- Search the embedding database for the document that is nearest to the prompt embedding 4- Retrieve the actual text of the document 5- Create a new prompt that includes the user’s question as well as the context from the document 6- Give the newly crafted prompt to the language model 7- Return the answer to the user 8- Bonus: provide a link to the document where the user can further obtain information

Document embeddings#

A list of numbers (numerical vector) representing the features of some information.

Process

Generate embeddings from your documents

Options for generating an embedding include
Storing the embeddings.
1. Python associated Faiss
2. Pinecone (online)

Customising LLMs#

Frameworks#

Other tools#

Examples#

LLM Embedding and Fine Tuning#

Summary#

Document embeddings#

Document embeddings#

LLM Embedding and Fine Tuning #

Document embeddings #