More on Prompting Techniques

Prompt Engineering

Mick McQuaid

mcq@utexas.edu

University of Texas at Austin

03 Mar 2025

Week Six

Agenda

Presentations: Kylie, Aditya

News

Review whatialreadyknow (Ishwari)

eC review

m1 review

Haystack (again)

Techniques

Work time

Presentations

News

The Batch

$\langle$ pause to look at this week’s edition $\rangle$

WhatIAlreadyKnow (Ishwari)

eC review

$\langle$ look at the doc $\rangle$

m1 review

It’s a colossal blunder to fail to turn in either the qmd or html file

I’m going to suggest that almost everyone use Huggingface datasets instead of Kaggle
- For example, Huggingface has 24 Shakespeare datasets

Each group should probably focus on one task, such as recommendation (3), question answering (1), or summarization (2) (One group has several tasks)

Although you are using different platforms, you will be required to extensively document what you’re doing in a qmd file and an html file for the three remaining milestones

If you’re using Jupyter, leave enough time to translate your notebook to quarto

Haystack

Break into new pairs as specified on Canvas.

Tutorial components

The tutorial shows you how to create a generative question-answering pipeline using the retrieval-augmentation (RAG) approach with Haystack 2.0. The process involves four main components:

SentenceTransformersTextEmbedder for creating an embedding for the user query,

InMemoryBM25Retriever for fetching relevant documents,

PromptBuilder for creating a template prompt, and

OpenAIChatGenerator for generating responses.

But first, data!

quickStart

tutorialData

Try the Python fragments from the tutorial on the seven-wonders dataset

If that’s successful, try it on your own dataset

The fragments are Check dataset validity, List configuration and splits, Preview a dataset, and Get the size of the dataset.

Embed documents

Once you have the data, you can embed it using the SentenceTransformersTextEmbedder. This will create an embedding for each document in the dataset.

You could use a different embedder. Let’s look at the list of embedders at https://docs.haystack.deepset.ai/docs/embedders. What you do will depend on the data you choose for your project.

For now, let’s use the default.

By the way, what is an embedding?

In its simplest form, an embedding is a vector of numbers that represents (in this case) a sentence. The vector is created by a neural network that has been trained to learn the meaning of the word or sentence.

Start building the RAG pipeline

What is RAG, again?

RAG is a retrieval-augmented generation approach. It uses a retriever to find relevant documents, and then uses a generator to generate a response.

The pipeline generates a response by combining the retrieved documents with the user query.

You will use a text embedder for the user’s question that matches the document embedder you used to embed the documents.

You will use a retriever to find relevant documents.

You will define a prompt template that will be used to generate the response.

You will use a chat generator to generate the response.

Initialize the pipeline

The tutorial has initialization steps for the above components.

The steps are:
- Initialize the text embedder
- Initialize the retriever
- Initialize the prompt builder
- Initialize the chat generator

Build the pipeline

The tutorial has a build step for the above components.

Asking a question

The tutorial has a run step for the above components.

In this case, a single question is asked.

The question is: “What does the Rhodes Statue look like?”

Some examples are given that could be run in a loop.

Your simple interface to the pipeline is probably going to be a simple loop.

I’m not going to require any error-handling or user-friendly interface components. You can consider me the only user and I can be given instructions to follow, such as to press Ctrl-D to exit the loop.

The interface need not be a web-based interface but can be. You are welcome to ask a model to generate a web-based interface but a simple command-line interface is sufficient.

References

Schulhoff, Sander, Michael Ilie, Nishant Balepur, Konstantine Kahadze, Amanda Liu, Chenglei Si, Yinheng Li, et al. 2024. “The Prompt Report: A Systematic Survey of Prompting Techniques.” https://arxiv.org/abs/2406.06608.

Colophon

This slideshow was produced using quarto

Fonts are Roboto Light, Roboto Bold, and JetBrains Mono Nerd Font