Vertex AI RAG with Gemini 2 Flash

A complete implementation of Retrieval-Augmented Generation (RAG) workflow using Google Cloud's Vertex AI and Gemini 2.0 Flash.

Step-by-Step RAG Workflow

Create and manage RAG corpus in Vertex AI
Import documents from Google Cloud Storage
Configure chunking and embedding models
Perform direct context retrieval
Generate AI responses enhanced with document context using Gemini models
Clean up RAG corpora after demo

Requirements

Google Cloud account with Vertex AI access
Python 3.8+
Required Python packages (see requirements.txt)

Setup

Enable required Google Cloud services:

gcloud services enable aiplatform.googleapis.com --project=PROJECT_ID
gcloud services enable storage.googleapis.com --project=PROJECT_ID

Set up IAM permissions:

gcloud projects add-iam-policy-binding PROJECT_ID --member="user:YOUR_EMAIL@domain.com" --role="roles/aiplatform.user" 
gcloud projects add-iam-policy-binding PROJECT_ID --member="user:YOUR_EMAIL@domain.com" --role="roles/storage.objectAdmin"

Create a Google Cloud Storage bucket and upload your PDF files:

# Create a new GCS bucket (skip if you already have one)
gsutil mb -l us-central1 gs://your-bucket-name

# Upload PDF files to the bucket
gsutil cp your-document.pdf gs://your-bucket-name/

# Verify files were uploaded successfully
gsutil ls gs://your-bucket-name/

Note: Remember the path to your files (e.g., gs://your-bucket-name/your-document.pdf) as you'll need it for the corpus creation.

Set up authentication credentials:

# Download your service account key from GCP Console and set the environment variable
export GOOGLE_APPLICATION_CREDENTIALS=/path/to/your/service-account-key

# On Windows, use:
# set GOOGLE_APPLICATION_CREDENTIALS=C:\path\to\your\service-account-key

Note: This step is critical for authenticating with Google Cloud services. You need a service account with the appropriate permissions.

Clone this repository:

git clone https://github.com/arjunprabhulal/vertex-ai-rag.git
cd vertex-ai-rag

Install dependencies:

pip install -r requirements.txt
pip install --upgrade google-cloud-aiplatform

Update the configuration in vertex_ai_rag.py:
- Set your PROJECT_ID
- Configure your corpus display name
- Add paths to your documents in Google Cloud Storage:
```
paths = ["gs://your-bucket-name/your-document.pdf"] 
```

Usage

Run the main script:

python vertex_ai_rag.py

Documentation

For more information on Vertex AI RAG, see the official Google Cloud documentation.

Author

For more articles on AI/ML and Generative AI, follow me on Medium: https://medium.com/@arjun-prabhulal

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
images		images
README.md		README.md
requirements.txt		requirements.txt
vertex_ai_rag.py		vertex_ai_rag.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vertex AI RAG with Gemini 2 Flash

Step-by-Step RAG Workflow

Requirements

Setup

Usage

Documentation

Author

About

Releases

Packages

Languages

arjunprabhulal/vertex-ai-rag

Folders and files

Latest commit

History

Repository files navigation

Vertex AI RAG with Gemini 2 Flash

Step-by-Step RAG Workflow

Requirements

Setup

Usage

Documentation

Author

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages