6:[["$","$Le",null,{}],["$","div",null,{"className":"min-h-screen bg-gray-100 p-6","children":[["$","$Lf",null,{}],["$","script",null,{"type":"application/ld+json","dangerouslySetInnerHTML":{"__html":"{\"@context\":\"https://schema.org\",\"@type\":\"QAPage\",\"mainEntity\":{\"@type\":\"Question\",\"name\":\"How to speed up the embedding process in LangChain\",\"text\":\"

I have approximately 1600 short text files to embed using Sentence Transformers and store in a chroma vector in LangChain.

\\n

I want to create a Retrieval Question/Answering (QA) capability to retrieve those text files. I've done the processing and started the embedding process, but it's been 3-4 hours and it's still running. Is there any way to speed up this process?

\\n\",\"author\":{\"@type\":\"Person\",\"name\":\"mechanicalsloth\"},\"upvoteCount\":1,\"answerCount\":1,\"acceptedAnswer\":null}}"}}],["$","div",null,{"className":"bg-white shadow-md rounded-lg p-6 mb-6 relative","children":[["$","div",null,{"className":"absolute top-4 right-4 flex flex-wrap space-x-2","children":[["$","span","nlp",{"className":"bg-blue-600 text-white text-sm px-3 py-1 rounded-full","children":["$","$L10",null,{"href":"/discussion/tag/nlp/1","children":"nlp"}]}],["$","span","embedding",{"className":"bg-blue-600 text-white text-sm px-3 py-1 rounded-full","children":["$","$L10",null,{"href":"/discussion/tag/embedding/1","children":"embedding"}]}],["$","span","langchain",{"className":"bg-blue-600 text-white text-sm px-3 py-1 rounded-full","children":["$","$L10",null,{"href":"/discussion/tag/langchain/1","children":"langchain"}]}]]}],["$","div",null,{"className":"flex items-center mb-4","children":[["$","img",null,{"src":"https://www.gravatar.com/avatar/26e2c612343bc889cdc7bdde888e27fe?s=256&d=identicon&r=PG&f=y&so-version=2","alt":"mechanicalsloth","className":"w-16 h-16 rounded-full border"}],["$","div",null,{"className":"ml-4","children":[["$","a",null,{"href":"https://stackoverflow.com/users/21409749/mechanicalsloth","target":"_blank","rel":"noopener noreferrer","className":"text-lg font-semibold text-blue-600 hover:underline","children":"mechanicalsloth"}],["$","p",null,{"className":"text-sm text-gray-500","children":["Reputation: ",11]}]]}]]}],["$","h1",null,{"className":"text-2xl font-bold text-gray-800 mb-4","children":"How to speed up the embedding process in LangChain"}],["$","p",null,{"className":"text-gray-700 mt-4","dangerouslySetInnerHTML":{"__html":"

I have approximately 1600 short text files to embed using Sentence Transformers and store in a chroma vector in LangChain.

\n"}}],["$","div",null,{"className":"text-gray-600 text-sm mt-4","children":[["$","p",null,{"children":["Upvotes: ",1]}],["$","p",null,{"children":["Views: ",2141]}]]}]]}],["$","div",null,{"className":"container mx-auto","children":[["$","h2",null,{"className":"text-2xl font-semibold text-gray-800 mb-6","children":["Answers (",1,")"]}],[["$","div","76534841",{"className":"bg-white shadow-md rounded-lg p-6 mb-6","children":[["$","div",null,{"className":"flex items-center mb-4","children":[["$","img",null,{"src":"https://i.sstatic.net/WkrdO.png?s=256","alt":"tomasantunes","className":"w-12 h-12 rounded-full border"}],["$","div",null,{"className":"ml-4","children":[["$","a",null,{"href":"https://stackoverflow.com/users/4102203/tomasantunes","target":"_blank","rel":"noopener noreferrer","className":"text-lg font-semibold text-blue-600 hover:underline","children":"tomasantunes"}],["$","p",null,{"className":"text-sm text-gray-500","children":["Reputation: ",945]}]]}]]}],["$","p",null,{"className":"text-gray-700 mb-4","dangerouslySetInnerHTML":{"__html":"

You should try a smaller sample and do the math to see how long it really takes. It's a computationally intensive task so if you want to speed it up you have to rent compute time with a faster GPU.

It helps to do these tasks on a pre-processing stage and then keep a memory cache for fast delivery.

\n"}}],["$","div",null,{"className":"text-gray-600 text-sm","children":["$","p",null,{"children":["Upvotes: ",0]}]}]]}]]]}],["$","div",null,{"className":"bg-white shadow-md rounded-lg p-6 mt-6","children":[["$","h2",null,{"className":"text-2xl font-semibold text-gray-800 mb-4","children":"Related Questions"}],["$","ul",null,{"className":"list-disc list-inside","children":[["$","li","76232375",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/76232375","className":"text-blue-600 hover:underline","children":"LangChain Chroma - load data from Vector Database"}]}],["$","li","77984922",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/77984922","className":"text-blue-600 hover:underline","children":"How can I get the embedding of a document in langchain?"}]}],["$","li","76927410",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/76927410","className":"text-blue-600 hover:underline","children":"Chatbot using csv file"}]}]]}]]}]]}],["$","$L11",null,{}],["$","$L12",null,{}],["$","$L13",null,{}],["$","$L14",null,{}],["$","$L15",null,{}]]

How to speed up the embedding process in LangChain

Answers (1)

Related Questions