Basics of Prompt Engineering¶

This work is licensed under a Creative Commons Attribution 4.0 International License.

CyVerse Full Prompt Engineering Workshop

Local LLMs vs APIs¶

CyVerse Verde¶

Managing API keys¶

Extension Installation¶

Open VS Code.
Navigate to the Extensions view by clicking the icon in the Activity Bar on the side of the window or by pressing Ctrl+Shift+X (Windows/Linux) or Cmd+Shift+X (macOS).
In the search bar, type "Roo Code" (or "Cline" if Roo Code isn't listed and you're using vanilla Cline).
Find the official extension from the search results and click Install.
Once installed, you might need to reload VS Code if prompted.

Selecting an API¶

After installation, you'll typically need to configure an LLM API endpoint and key. Look for settings related to Roo Code or Cline in VS Code's settings (Ctrl+, or Cmd+,).

Google Gemini¶

Obtain your Google Gemini API key from Google AI Studio or Google Cloud Console.
In VS Code settings, search for "Roo Code Gemini" or a similar setting.
Enter your API key in the designated field (e.g., Roo Code: Gemini API Key).
You might also need to specify the model (e.g., gemini-pro).

Ollama (for Local Models)¶

Ollama allows you to run open-source LLMs locally.

Ensure Ollama is installed and running on your machine with the desired models downloaded (e.g., ollama pull gemma:2b).
In VS Code settings for Roo Code/Cline, look for an option to specify the Ollama API endpoint. This is usually http://localhost:11434 by default.
Select or specify the Ollama model you wish to use (e.g., gemma, llama3, codellama). No API key is typically needed for local Ollama usage directly, but the extension must be configured to point to the local server.

OpenAI Compatible¶

This is for services that adhere to the OpenAI API specification, which can include OpenAI itself or other providers like Azure OpenAI or local LLM servers.

Obtain your API key and API base URL (endpoint) from your provider.
- For OpenAI: Key from platform.openai.com. Endpoint is typically https://api.openai.com/v1.
- For Azure OpenAI: Key and endpoint from your Azure deployment.
- For others: Refer to your provider's documentation.
In VS Code settings for Roo Code/Cline:
- Enter the API key (e.g., Roo Code: OpenAI API Key).
- Enter the API base URL if it's different from the default (e.g., Roo Code: OpenAI API Base URL).
- Select the desired model (e.g., gpt-4o, gpt-3.5-turbo).

Claude (via API)¶

If Roo Code/Cline supports direct Claude API integration (distinct from the Claude Desktop app):

Obtain your Anthropic API key from the Anthropic Console.
In VS Code settings for Roo Code/Cline, search for "Roo Code Claude" or a similar setting.
Enter your API key (e.g., Roo Code: Claude API Key).
Specify the Claude model you wish to use (e.g., claude-3-opus-20240229).

Restart for Changes

After changing API settings, it's often a good idea to restart VS Code or the extension itself if it provides such an option, to ensure the new settings take effect.

Setting up GitHub Copilot on VS Code Locally¶

GitHub Copilot is deeply integrated into the GitHub ecosystem and VS Code (local).

In GitHub CodeSpaces¶

Enable Copilot for your account: Ensure you have an active GitHub Copilot subscription associated with your GitHub account.
Launch a CodeSpace: When you create or open a repository in GitHub CodeSpaces, Copilot is often enabled by default if your account has access.
Check Status: Look for the Copilot icon in the status bar at the bottom of the VS Code interface within CodeSpaces. If it's not active, click it to see options or troubleshoot. You might need to authorize it for the specific CodeSpace.

Extension Installation in VS Code (Desktop)¶

Open VS Code.
Navigate to the Extensions view ( or Ctrl+Shift+X / Cmd+Shift+X).
Search for "GitHub Copilot".
Find the official extension by GitHub and click Install.
Sign In: After installation, VS Code will prompt you to sign in with your GitHub account. Follow the prompts to authorize VS Code to use GitHub Copilot.
- If you're not prompted, you can often click the user icon in the bottom left of VS Code and sign in there, or find a "Sign In to GitHub Copilot" command in the Command Palette (Ctrl+Shift+P or Cmd+Shift+P).
Once signed in and with an active subscription, Copilot will be ready to assist you. You'll see its icon in the status bar.

Vibe Coding¶

Vibe coding refers to using an LLM to generate and edit code directly within your IDE (e.g., VS Code). This approach allows for a more fluid and interactive coding experience, where the LLM acts as a collaborative partner.

Allowing an LLM to execute code on your computer may be a violation of institutional security and privacy policy

Coding tools like Cline and Windsurf give you the option to allow 'execution' of code on your machine.

You must understand the implications of giving these LLMs the authority to execute code on your computer and the network it is running upon.

Malicious code lives on the internet, and your Vibing LLM might install it while you're not paying attention

Read more: Vibe Check: False Packages A New LLM Security Risk (Note: This is a fictional link as per the example for demonstration).

Vibe Coding Platforms¶

Emoji	Meaning
	VS Code
	GitHub CodeSpace
	Apple OS
	Windows
	Command Line Interface
	Open Source
	Licensed
	API based

Aider A popular command-line tool for AI-driven coding, often used with local or remote LLMs.
Claude Desktop An easy-to-install desktop platform that connects to Anthropic's powerful LLM API, and allows you to connect to MCP servers.
Cursor A popular standalone fork of VS Code, focused on integrating new models with stability and offering a flat-fee pricing model.
GitHub Copilot Integrated with VS Code and GitHub CodeSpaces, provides agentic coding with periodic performance fluctuations and tiered pricing.
Cline Open-source and model-agnostic, pioneering features like “bring your own model” (BYOM) and operating on a per-request billing structure.
Roo Code Derived from Cline, prioritizes rapid feature development and customization, serving users interested in experimental capabilities.
Windsurf Offers similar agentic and inline features with tiered pricing and a “just works” usability orientation.

Introduction to Prompt Engineering¶

Prompt Engineering is a technique of crafting effective instructions using AI large language models. With modern AI-powered tools like Claude Desktop, ChatGPT, Gemini, and NotebookLM offering capabilities to upload documents, search the web, and process multiple file types, mastering prompt engineering has become essential for productive AI interactions.

What You'll Learn

Fundamentals: How AI models process and respond to prompts
Modern Features: Leveraging document uploads, web search, and multi-modal inputs
Best Practices: Structured approaches to writing effective prompts
Advanced Techniques: Context management, chaining, and custom instructions

Understanding Modern AI Capabilities¶

Core Features of Today's AI Tools¶

Modern AI assistants have evolved beyond simple text chat:

Feature	Claude	ChatGPT	Gemini	NotebookLM	CoPilot
Document Upload	PDFs, text, code	PDFs, images, data	PDFs, images, GDrive	PDFs, Google Docs	PDFs, OneDrive
Web Search	Via MCP	Yes	Yes	Yes	Yes
Context Window (tokens)	200K	128K	2M	Document-based	128K
File Analysis	Yes	Yes	Yes	Deep analysis	Yes
Code Execution	Yes (MCP)	Yes	Yes	No	Yes

How AI Models Process Your Input¶

The Processing Pipeline

Tokenization: Your prompt is broken into smaller units (tokens)
Context Assembly: Uploaded documents and conversation history are included
Attention Mechanism: The model identifies relevant information
Generation: Response is produced token by token
Formatting: Output is structured according to your specifications

Getting Started: Basic Prompt Structure¶

The Foundation: Clear Instructions¶

Start with simple, direct prompts before advancing to complex techniques:

# Basic Prompt
"Summarize this research paper in 3 bullet points"

# Better Prompt
"As a research scientist, summarize the key findings from this paper 
in 3 bullet points, focusing on methodology and results"

# Best Prompt
"You are a research scientist reviewing papers for a journal. 
Summarize the attached PDF in 3 bullet points that cover:
1. Research question and hypothesis
2. Methodology and sample size
3. Key findings and limitations
Format as a bullet list with sub-points for clarity."

Working with Documents¶

Modern AI tools excel at document analysis. Here's how to maximize their potential:

Document Upload Best Practices

Specify the document: "In the attached PDF..." or "Based on the uploaded spreadsheet..."
Direct attention: "Focus on Section 3.2 of the document"
Request specific outputs: "Create a table comparing the methods described in chapters 2 and 5"
Combine multiple sources: "Compare the findings in these three papers"

Example: Multi-Document Analysis¶

I've uploaded three research papers on climate change. Please:

1. Create a comparison table with columns for:
   - Paper title and authors
   - Methodology
   - Key findings
   - Limitations

2. Identify common themes across all papers

3. Highlight any contradictory findings

Format the response with clear headers and use markdown tables.

The CRAFT Framework¶

For consistent, high-quality results, use the CRAFT framework:

Context¶

Provide background information and set the scene

Role¶

Define who the AI should act as

Action¶

Specify exactly what you want done

Format¶

Describe how the output should be structured

Tone¶

Indicate the style and voice to use

CRAFT Example¶

Context: I'm preparing a grant proposal for NSF funding on AI in education

Role: Act as an experienced grant writer and education researcher

Action: Review my draft introduction and suggest improvements

Format: Provide feedback as tracked changes with explanations

Tone: Professional, constructive, and encouraging

Advanced Techniques¶

1. Custom Instructions and System Prompts¶

Modern AI platforms allow you to set persistent instructions:

'Custom Instructions' or 'System Instructions'

Platforms like Gemini and Claude allow you to add "Custom Instructions" or "System Instructions" as prior prompts, which act as a global rule to subsequent prompt chaining.

For example:

# Project Context
I'm a data scientist working on machine learning projects.
Always provide Python code examples using scikit-learn and pandas.
Include docstrings and type hints in all code.

# Response Preferences
- Be concise but thorough
- Explain complex concepts with analogies
- Always cite sources when making factual claims

2. Leveraging Web Search¶

Most featured GPTs now feature a web browse or search engine capability.

Enabling search allows the GPT to use document retrieval on websites and PDFs when reasoning out its response.

Search for the latest research on the public health benefits of vaccination published in 2024. 

Focus on:
- Papers from top conferences (AHA, ASPPH, NRHA, ICFMDP)
- mRNA
- Bird Flu and COVID

Summarize the top 5 papers with links to the originals.

Combine different input types for richer interactions:

I've uploaded:
1. A screenshot of my dashboard
2. The underlying data in CSV format
3. Our brand guidelines PDF

Create a redesigned dashboard that:
- Improves data visualization based on best practices
- Adheres to our brand colors and fonts
- Highlights the KPIs mentioned in the data dictionary

4. Prompt Chaining¶

Build complex outputs through sequential prompts:

Effective Chaining Strategy

Start broad: "Outline a research paper on sustainable AI"
Zoom in: "Expand section 3 on energy-efficient training methods"
Refine: "Add citations and make the tone more academic"
Polish: "Format according to IEEE standards"

5. Using Examples (Few-Shot Learning)¶

Provide examples to guide the AI's output:

I need to classify customer feedback. Here are examples:

"The product arrived damaged" → Category: Shipping Issue
"Can't log into my account" → Category: Technical Support
"Love the new features!" → Category: Positive Feedback

Now classify these:
1. "The app keeps crashing on startup"
2. "Best purchase I've made this year"
3. "Package was left in the rain"

Practical Applications¶

Research and Analysis¶

Analyze the attached dataset (CSV) and:
1. Identify statistical patterns and outliers
2. Create visualizations for the top 3 insights
3. Write a methods section describing the analysis
4. Suggest additional analyses based on the data

Use pandas profiling techniques and create matplotlib visualizations.
Include code that I can run locally.

Writing and Editing¶

I've uploaded my draft manuscript. Please:

1. Check for consistency in terminology throughout
2. Ensure all figures are referenced in the text
3. Verify the citation format matches APA 7th edition
4. Highlight any unclear passages
5. Suggest improvements for flow between sections

Provide a tracked-changes version and a summary of major edits.

Code Development¶

Based on the uploaded requirements document:

1. Create a Python class structure for the described system
2. Include comprehensive docstrings and type hints
3. Add unit tests for each method
4. Create a README with installation and usage instructions
5. Follow PEP 8 style guidelines

Use modern Python features (3.10+) and include error handling.

Common Pitfalls and Solutions¶

Pitfall 1: Vague Instructions¶

❌ Poor: "Make this better"

✅ Better: "Improve this abstract by making it more concise (under 250 words), adding keywords, and ensuring it follows the journal's structure: background, methods, results, conclusions"

Pitfall 2: Information Overload¶

❌ Poor: Uploading 50 documents without guidance

✅ Better: "Focus on documents 1-3 which contain the methodology. Ignore the appendices."

Pitfall 3: Assuming Knowledge¶

❌ Poor: "Fix the usual issues"

✅ Better: "Check for: passive voice, sentences over 25 words, undefined acronyms, and missing Oxford commas"

Pitfall 4: No Output Format¶

❌ Poor: "Summarize this"

✅ Better: "Create an executive summary with: - 3-sentence overview - 5 key points as bullets - 1 paragraph on implications - Formatted with markdown headers"

Quick Reference Card¶

Prompt Engineering Checklist

Clear objective: What do you want to achieve?
Context provided: Background information included?
Role defined: Who should the AI act as?
Specific action: Exact task described?
Output format: Structure specified?
Examples given: For complex tasks?
Constraints noted: Length, style, or content limits?
Documents referenced: If using uploads?
Follow-up planned: For iterative improvement?

Assessment Questions¶

How do modern AI tools handle uploaded documents?

Answer

Modern AI tools process uploaded documents by:

Converting them to text (OCR for images/PDFs)
Adding them to the context window
Allowing specific references ("In section 2.3...")
Enabling cross-document analysis
Maintaining document structure awareness

What's the most important element of an effective prompt?

Answer

Clarity of instruction is paramount. The AI needs to understand:

What you want done (action)
How you want it done (format)
Why you want it done (context)

Without clear instructions, even the most advanced AI will produce suboptimal results.

How can you ensure consistent outputs across multiple sessions?

Answer

Use custom instructions (ChatGPT, Claude) or system prompts
Create templates for common tasks
Save successful prompts for reuse
Use platform features like GPTs or Projects
Include examples in your prompts
Specify exact formats with templates

True or False: Longer prompts always produce better results

False

Prompt quality matters more than length. A well-structured, concise prompt often outperforms a lengthy, unfocused one. However, providing sufficient context and clear instructions is important. Aim for:

Completeness over brevity
Clarity over complexity
Structure over stream-of-consciousness

Basics of Prompt Engineering¶

Local LLMs vs APIs¶

CyVerse Verde¶

Managing API keys¶

Extension Installation¶

Selecting an API¶

Google Gemini¶

Ollama (for Local Models)¶

OpenAI Compatible¶

Claude (via API)¶

Setting up GitHub Copilot on VS Code Locally¶

In GitHub CodeSpaces¶

Extension Installation in VS Code (Desktop)¶

Vibe Coding¶

Vibe Coding Platforms¶

Introduction to Prompt Engineering¶

Understanding Modern AI Capabilities¶

Core Features of Today's AI Tools¶

How AI Models Process Your Input¶

Getting Started: Basic Prompt Structure¶

The Foundation: Clear Instructions¶

Working with Documents¶

Example: Multi-Document Analysis¶

The CRAFT Framework¶

Context¶

Role¶

Action¶

Format¶

Tone¶

CRAFT Example¶

Advanced Techniques¶

1. Custom Instructions and System Prompts¶

2. Leveraging Web Search¶

3. Multi-Modal Prompting¶

4. Prompt Chaining¶

5. Using Examples (Few-Shot Learning)¶

Practical Applications¶

Research and Analysis¶

Writing and Editing¶

Code Development¶

Common Pitfalls and Solutions¶

Pitfall 1: Vague Instructions¶

Pitfall 2: Information Overload¶

Pitfall 3: Assuming Knowledge¶

Pitfall 4: No Output Format¶

Quick Reference Card¶

Assessment Questions¶

Further Resources¶