RAG - Specifying task_type during Question Answering with Vertex AI Embeddings

Question

I'm using Vertex AI embeddings with LangChain for a RAG application. Reference: https://cloud.google.com/blog/products/ai-machine-learning/improve-gen-ai-search-with-vertex-ai-embeddings-and-task-types/

I've created my embeddings using task_type="QUESTION_ANSWERING". However, I can't figure out how to specify the same task_type during the actual question-answering retrieval process. The code I'm using is below:


from langchain_google_vertexai import VertexAIEmbeddings
from langchain.chains import RetrievalQA

vertex_embeddings = VertexAIEmbeddings(model_name="text-multilingual-embedding-002")

# Some code to retrieve pgvector vector_store
vector_store = get_pgvector(collection_name)

# Create chain to answer questions
NUMBER_OF_RESULTS         = 1
SEARCH_DISTANCE_THRESHOLD = 0.6

retriever = vector_store.as_retriever(
    search_type="similarity",
    search_kwargs={
        "k": NUMBER_OF_RESULTS,
        "search_distance": SEARCH_DISTANCE_THRESHOLD,
    },
)


qa = RetrievalQA.from_chain_type(
    llm                     = get_llm(),
    chain_type              = "stuff",
    retriever               = retriever,
    return_source_documents = True,
    verbose                 = True,
    chain_type_kwargs       = {
        "prompt": PromptTemplate(
            template = prompt_template,   
            input_variables = ["context", "question"],
        ),
    },
)

I haven't found any way to pass the task_type to the retrieval process. One workaround is to increase NUMBER_OF_RESULTS and then filter the results using sklearn's cosine_similarity based on the task_type, but this adds unwanted latency.

Is there a way to directly specify the task_type during retrieval with langchain_google_vertexai and pgvector so that the most relevant results for question answering are returned directly, avoiding the need for post-processing? Any suggestions or examples would be greatly appreciated!

RAG - Specifying task_type during Question Answering with Vertex AI Embeddings

Answers (1)

Related Questions