Langflow

Langflow is a visual framework for building multi-agent and RAG applications. It is open-source, fully customizable, and works with most LLMs and many vector stores out of the box. Langflow designer

This no-code, hands-on demonstration walks you through creating a Langflow project that enables you to use GPT-4o-mini to chat in real time with a PDF document that is processed by Unstructured and has its processed data stored in an Astra DB vector database.

Prerequisites

An Astra account. Create or sign in to an Astra account.
A database in the Astra account. Create a database in an account.
An application token for the database. Create a database application token.
A namespace in the database. Create a namespace in a database.
A collection in the namespace. Create a collection in a namespace. An existing collection is not required. At runtime, the collection behavior is as follows: For the Unstructured UI and Unstructured API:
- If an existing collection name is specified, and Unstructured generates embeddings, but the number of dimensions that are generated does not match the existing collection’s embedding settings, the run will fail. You must change your Unstructured embedding settings or your existing collection’s embedding settings to match, and try the run again.
- If a collection name is not specified, Unstructured creates a new collection in your namespace. If Unstructured generates embeddings, the new collections’s name will be u<short-workflow-id>_<short-embedding-model-name>_<number-of-dimensions>. If Unstructured does not generate embeddings, the new collections’s name will be u<short-workflow-id.
For Unstructured Ingest:
- If an existing collection name is specified, and Unstructured generates embeddings, but the number of dimensions that are generated does not match the existing collection’s embedding settings, the run will fail. You must change your Unstructured embedding settings or your existing collections’s embedding settings to match, and try the run again.
- If a collection name is not specified, Unstructured creates a new collection in your Pinecone account. The new collection’s name will be unstructuredautocreated.

Also:

Sign up for an OpenAI account, and get your OpenAI API key.
Sign up for a free Langflow account.
Get your Unstructured account and Unstructured API key:
1. Sign in to your Unstructured account:
  - If you do not already have an Unstructured account, go to https://unstructured.io/contact and fill out the online form to indicate your interest.
  - If you already have an Unstructured account, sign in by using the URL of the sign in page that Unstructured provided to you when your Unstructured account was created. After you sign in, the Unstructured user interface (UI) then appears, and you can start using it right away. If you do not have this URL, contact Unstructured Sales at sales@unstructured.io.
2. Get your Unstructured API key: a. In the Unstructured UI, click API Keys on the sidebar.
  b. Click Generate API Key.
  c. Follow the on-screen instructions to finish generating the key.
  d. Click the Copy icon next to your new key to add the key to your system’s clipboard. If you lose this key, simply return and click the Copy icon again.

Create and run the demonstration project

Create the Langflow project

Sign in to your Langflow dashboard.
From your dashboard, click New Project.
Click Blank Flow.

Add the Unstructured component

In this step, you add a component that instructs the Unstructured Partition Endpoint to process a local file that you specify.

On the sidebar, expand Experimental (Beta), and then expand Loaders.
Drag the Unstructured component onto the designer area.
In the Unstructured component, click the box or icon next to File, and then select a local file for Unstructured to process. This component works only with the file extensions .pdf, .docx, and .txt. Although you can use any local file with one of these extensions, this demonstration uses the text of the United States Constitution in PDF format, saved to your local development machine.
To work with multiple local files, or to work with remote files, see the suggested options in Next steps.
For Unstructured.io Serverless API Key, enter your Unstructured API key value.
Wait until Saved appears in the top navigation bar.

Add the OpenAI Embeddings component

In this step, you add a component that generates vector embeddings for the processed data that Unstructured outputs.

On the sidebar, expand Embeddings, and then drag the OpenAI Embeddings component onto the designer area.
In the OpenAI Embeddings component, for Model, select text-embedding-3-large.
For OpenAI API Key, enter your OpenAI API key’s value.
Wait until Saved appears in the top navigation bar.

Add the Astra DB components

In this step, you add two components. The first component instructs Astra DB to ingest into the specified Astra DB collection the processed data that Unstructured outputs along with the associated generated vector embeddings. The second component instructs Astra DB to take user-supplied chat messages and perform contextual searches over the ingested data in the specified Astra DB collection, outputting its search results.

On the sidebar, expand Vector Stores, and then drag the Astra DB component onto the designer area.
Double-click the Astra DB component’s title bar, and rename the component to Astra DB Ingest.
Repeat these previous two actions to add a second Astra DB component, renaming it to Astra DB RAG.
In both of these Astra DB components, in the Database list, select the name of your Astra DB database. Make sure this is the same database name in both components.
In the Collection list in both components, select the name of the collection in the database. Make sure this is the same collection name in both components.
In the Astra DB Application Token box in both components, enter your Astra DB application token’s value. Make sure this is the same application token value in both components.
Connect the Data output from the Unstructured component to the Ingest Data input in the Astra DB Ingest component. To make the connection, click and hold your mouse pointer inside of the circle next to Data in the Unstructured component. While holding your mouse pointer, drag it over into the circle next to Ingest Data in the Astra DB Ingest component. Then release your mouse pointer. A line appears between these two circles.
Connect the Embeddings output from the OpenAI Embeddings component to the Embedding or Astra Vectorize input in the Astra DB Ingest component.
Wait until Saved appears in the top navigation bar.
In the title bar of the Astra DB Ingest component, click the play icon. This ingests the processed data from Unstructured and the associated generated vector embeddings into the specified Astra DB collection.
Wait until Building disppears from the top navigation bar, and a green check mark appears next to this play icon. This could take several minutes.
Each time you click the play icon in the Astra DB Ingest component, Unstructured reprocesses the specified local file. If this file does not change, this could result in multiple duplicate records being inserted into the specified Astra DB collection. You should only click the play icon in the Astra DB Ingest component when you want to insert new processed data into the specified Astra DB collection.

Add the Chat Input component

In this step, you add a component that takes user-supplied chat messages and sends them as input to Astra DB for contextual searching.

On the sidebar, expand Inputs, and then drag the Chat Input component onto the designer area.
Connect the Message output from the Chat Input component to the Search Input input in the Astra DB RAG component.
Wait until Saved appears in the top navigation bar.

Add the Parse Data component

In this step, you add a component that takes the Astra DB search results and converts them into plain text, suitable for inclusion in a prompt to a text-based LLM.

On the sidebar, expand Helpers, and then drag the Parse Data component onto the designer area.
Connect the Search Results output from the Astra DB RAG component to the Data input in the Parse Data component.
Wait until Saved appears in the top navigation bar.

Add the Prompt component

In this step, you add a component that builds a prompt and then sends it to a text-based LLM.

On the sidebar, expand Prompts, and then drag the Prompt component onto the designer area.
In the Prompt component, next to Template, click the box or arrow icon.

In the Edit Prompt window, enter the following prompt:

{context}

---

Given the context above, answer the question as best as possible.

Question: {question}

Answer:

To answer the question, the preceding prompt uses the context along with general information that the text-based LLM is trained on. To use only the context to answer the question, you can change the prompt, for example to something like this:

{context}

---

Given the context above, answer the question as best as possible. Use only the context to answer the question. Do not use 
any other sources of information. If the context does not provide enough information to answer the question, reply with 
'I do not have enough context to answer the question.'

Question: {question}

Answer:

Click Check & Save.
Connect the Text output from the Parse Data component to the context input in the Prompt component.
Connect the Message output from the Chat Input component to the question input in the Prompt component.
You will now have two connections from the Message output in the Chat Input component:
- One connection was already made to the Search Input input in the Astra DB RAG component.
- Another connection has just now been made to the question input in the Prompt component.
Wait until Saved appears in the top navigation bar.

Add the OpenAI component

In this step, you create a component that sends a prompt to a text-based LLM and outputs the LLM’s response.

On the sidebar, expand Models, and then drag the OpenAI component onto the designer area.
In the Model Name list, select gpt-4o-mini.
For OpenAI API Key, enter your OpenAI API key’s value.
For Temperature, enter 0.1.
Connect the Prompt Message output from the Prompt component to the Input input in the OpenAI component.
Wait until Saved appears in the top navigation bar.

Add the Chat Output component

In this step, you create a component that returns the answer to the user’s original chat message.

On the sidebar, expand Outputs, and then drag the Chat Output component onto the designer area.
Connect the Text output from the OpenAI component to the Text input in the Chat Output component.
Wait until Saved appears in the top navigation bar.

The final project should look like this: Final project results

Run the project

In the designer area, click Playground.
Enter a question into the chat box, for example, What rights does the fifth amendment guarantee? Then press the send button.
Wait until the answer appears.
Ask as many additional questions as you want to.

Next steps

Now that you have your pipeline set up, here are just a few ways that you could modify it to support different requirements, such as processing multiple files or using a different vector store.

Process multiple files

In this demonstration, you pass to Unstructured a single local file. To pass multiple local or non-local files to Unstructured instead, you can use the Unstructured UI or the Unstructured API or Unstructured Ingest outside of Langflow. To do this, you can:

Use the Unstructured UI to create a workflow that relies on any available source connector to connect to Astra DB. Run this workflow outside of Langflow anytime you have new documents in that source location that you want Unstructured to process and then insert the new processed data into Astra DB. Then, back in the Langflow project, use the Playground to ask additional questions, which will now include the new data when generating answers.
Use Unstructured Ingest to create a pipeline that relies on any available source connector to connect to Astra DB. Run this pipeline outside of Langflow anytime you have new documents in that non-local source location that you want Unstructured to process and then insert the new processed data into Astra DB. Then, back in the Langflow project, use the Playground to ask additonal questions, which will now include the new data when generating answers.

Use a different vector store

In this demonstration, you use Astra DB as the vector store. Langflow and Unstructured support several vector stores in addition to Astra DB. To do this, you can: Use the Unstructured UI to create a workflow that relies on any available source connector to connect to one of the following available vector stores that Langflow also supports:

Run this workflow outside of Langflow anytime you have new documents in the source location that you want Unstructured to process and then insert the new processed data into the vector store. Then, back in the Langflow project, swap out the Astra DB RAG component for the corresponding Vector Stores component that matches the new vector store’s name. Configure the new component, and then use the Playground to ask additional questions, which will now use the new vector store when generating answers. Or, use Unstructured Ingest to create a pipeline that relies on any available source connector to connect to one of the following available vector stores that Langflow also supports:

Run this pipeline outside of Langflow anytime you have new documents in the source location that you want Unstructured to process and then insert the new processed data into the vector store. Then, back in the Langflow project, swap out the Astra DB RAG component for the corresponding Vector Stores component that matches the new vector store’s name. Configure the new component, and then use the Playground to ask additional questions, which will now use the new vector store when generating answers.

Notebooks

Code samples

Tool demos

Prerequisites

Create and run the demonstration project

Next steps

Process multiple files

Use a different vector store

Learn more

Notebooks

Code samples

Tool demos

​Prerequisites

​Create and run the demonstration project

​Next steps

​Process multiple files

​Use a different vector store

​Learn more

Prerequisites

Create and run the demonstration project

Next steps

Process multiple files

Use a different vector store

Learn more