How to use Google Vertex AI fine tuned model via Node.js

Question

I fine-tuned a model on Google Vertex AI. Before that, I was using regular models with this code(it works):

public static async SendMessage(prompt) {
    const vertexAI = new VertexAI({project: GOOGLE_PROJECT_ID, location: 'us-central1', googleAuthOptions: {keyFile: KEY_FILE_PATH}});

    const generativeModel = vertexAI.getGenerativeModel({
      model: 'gemini-1.0-pro',
    });

    try {
      const resp = await generativeModel.generateContent(prompt);
      const contentResponse = await resp.response;
      if(!contentResponse || !contentResponse.candidates || contentResponse.candidates.length == 0 || !contentResponse.candidates[0].content 
      || !contentResponse.candidates[0].content.parts || contentResponse.candidates[0].content.parts.length == 0) {
        throw Error("ERROR: NO RESPONSE RETURNED FROM GOOGLE GENAI")
      } else {
        return contentResponse.candidates[0].content.parts[0].text;
      }
    } catch(e) {
      console.error(e)
      return "Let's talk about this later."
    }
  }

Now, I'm trying to access my fine-tuned model with this code:

public static async SendMessage(prompt) {
    const ENDPOINT_URL = `https://us-central1-aiplatform.googleapis.com/v1/projects/${GOOGLE_PROJECT_ID}/locations/us-central1/endpoints/MY_ENDPOINT:predict`;

    const vertexAI = new VertexAI({project: GOOGLE_PROJECT_ID, location: 'us-central1', apiEndpoint: ENDPOINT_URL, googleAuthOptions: {keyFile: KEY_FILE_PATH}});

    const generativeModel = vertexAI.getGenerativeModel({
      model: 'FINE_TUNED_MODEL_NAME',
    });

    try {
      const resp = await generativeModel.generateContent(prompt);
      const contentResponse = await resp.response;
      if(!contentResponse || !contentResponse.candidates || contentResponse.candidates.length == 0 || !contentResponse.candidates[0].content 
      || !contentResponse.candidates[0].content.parts || contentResponse.candidates[0].content.parts.length == 0) {
        throw Error("ERROR: NO RESPONSE RETURNED FROM GOOGLE GENAI")
      } else {
        return contentResponse.candidates[0].content.parts[0].text;
      }
    } catch(e) {
      console.error(e)
      return "Let's talk about this later."
    }
  }

I created MY_ENDPOINT on Google Cloud console and deployed the model. But I'm not sure how to access it via a client(node.js in this case. It throws this error:

[2024-07-12T06:09:39.356Z] GoogleGenerativeAIError: [VertexAI.GoogleGenerativeAIError]: exception posting request to model
    at D:\Dev\Anima\Client
ode_modules\@google-cloud\vertexai\build\src\functions\generate_content.js:49:15
    at process.processTicksAndRejections (d:\Dev\Anima\Client\lib\internal\process	ask_queues.js:95:5)
    at async generateContent (D:\Dev\Anima\Client
ode_modules\@google-cloud\vertexai\build\src\functions\generate_content.js:39:22)
    at async GoogleGenAI.SendMessage (d:\Dev\Anima\Client\Anima\GoogleGenAI.ts:20:20)
    at async GoogleGenAIController.Send (file:///D:/Dev/Anima/Client/jsbuild/Anima/GenAIController.js:30:24) {stackTrace: TypeError: fetch failed
    at node:internal…undici:12502:13
    at process.processTick…, name: 'GoogleGenerativeAIError', stack: 'GoogleGenerativeAIError: [VertexAI.GoogleGene…lient/jsbuild/Anima/GenAIController.js:30:24)', message: '[VertexAI.GoogleGenerativeAIError]: exception posting request to model'}

EDIT: I realized that vertexAI.getGenerativeModel method adds prefix "models.../publisher/google/..." to the model name before sending if you don't start your model name with "models/", so I need to pass full model path. But I'm not sure what it is. Also I'm not sure if ENPOINT is correct.

How to use Google Vertex AI fine tuned model via Node.js

Answers (1)

Related Questions