Neo4j
Send processed data from Unstructured to Neo4j.
The requirements are as follows.
-
- For the Unstructured Platform, local Neo4j deployments are not supported.
- For Unstructured Ingest, local and non-local Neo4j deployments are supported.
The following video shows how to set up a Neo4j Aura deployment:
-
The username and password for the user who has access to the Neo4j deployment. The default user is typically
neo4j
.- For a Neo4j Aura instance, the defaut user’s is typically set when the instance is created.
- For an AWS Marketplace, Microsoft Azure Marketplace, or Google Cloud Marketplace deployment of Neo4j, the default user is typically set during the deployment process.
- For a local Neo4j deployment, you can set the default user’s initial password or recover an admin user and its password.
-
The connection URI for the Neo4j deployment, which starts with
neo4j://
,neo4j+s://
,bolt://
, orbolt+s://
; followed bylocalhost
or the host name; and sometimes ending with a colon and the port number (such as:7687
). For example:- For a Neo4j Aura deployment, browse to the target Neo4j instance in the Neo4j Aura account and click Connect > Drivers to get the connection URI, which follows the format
neo4j+s://<host-name>
. A port number is not used or needed. - For an AWS Marketplace, Microsoft Azure Marketplace, or Google Cloud Marketplace deployment of Neo4j, see Neo4j on AWS, Neo4j on Azure, or Neo4j on GCP for details about how to get the connection URI.
- For a local Neo4j deployment, the URI is typically
bolt://localhost:7687
- For other Neo4j deployment types, see the deployment provider’s documentation.
- For a Neo4j Aura deployment, browse to the target Neo4j instance in the Neo4j Aura account and click Connect > Drivers to get the connection URI, which follows the format
-
The name of the target database in the Neo4j deployment. A default Neo4j deployment typically contains two standard databases: one named
neo4j
for user data and another namedsystem
for system data and metadata. Some Neo4j deployment types support more than these two databases per deployment; Neo4j Aura instances do not.- Create additional databases for a local Neo4j deployment that uses Enterprise Edition; or for Neo4j on AWS, Neo4j on Azure, or Neo4j on GCP deployments.
- Get a list of additional available databases for a local Neo4j deployment that uses Enterprise Edition; or for Neo4j on AWS, Neo4j on Azure, or Neo4j on GCP deployments.
Graph Output
The graph ouput of the Neo4j destination connector is represented in the following diagram:
View the preceding diagram in full-screen mode.
In the preceding diagram:
- The
Document
node represents the source file. - The
UnstructuredElement
nodes represent the source file’s UnstructuredElement
objects, before chunking. - The
Chunk
nodes represent the source file’s UnstructuredElement
objects, after chunking. - Each
UnstructuredElement
node has aPART_OF_DOCUMENT
relationship with theDocument
node. - Each
Chunk
node also has aPART_OF_DOCUMENT
relationship with theDocument
node. - Each
UnstructuredElement
node has aPART_OF_CHUNK
relationship with aChunk
element. - Each
Chunk
node, except for the “last”Chunk
node, has aNEXT_CHUNK
relationship with its “next”Chunk
node.
Learn more about document elements and chunking.
Some related example Neo4j graph queries include the following.
Query for all nodes:
Query for Chunk
to Document
relationships:
Query for UnstructuredElement
to Document
relationships:
Query for UnstructuredElement
to Chunk
relationships:
Query for Chunk
to Chunk
relationships:
Query for UnstructuredElement
to Chunk
to Document
relationships:
Query for UnstructuredElements
containing the text jury
, and show their Chunk
relationships:
Query for the Chunk
with the specified id
, and show its UnstructuredElement
relationships:
To create the destination connector:
- On the sidebar, click Connectors.
- Click Destinations.
- Cick New or Create Connector.
- Give the connector some unique Name.
- In the Provider area, click Neo4j.
- Click Continue.
- Follow the on-screen instructions to fill in the fields as described later on this page.
- Click Save and Test.
Fill in the following fields:
- Name (required): A unique name for this connector.
- URI (required): The connection URI for the Neo4j deployment, which typically starts with
neo4j://
,neo4j+s://
,bolt://
, orbolt+s://
; is followed by the host name; and ends with a colon and the port number (such as:7473
,:7474
, or:7687
). - Database (required): The name of the target database in the Neo4j deployment. A default Neo4j deployment typically contains a standard database named
neo4j
for user data. - Username (required): The name of the user who has access to the Neo4j deployment. A default Neo4j deployment typically contains a default user named
neo4j
. - Password (required): The password for the user.
- Batch Size: The maximum number of nodes or relationships to be transmitted per batch. The default is
100
if not otherwise specified.
Was this page helpful?