GCP Dataplex Universal Catalog is Google’s managed metadata and data governance service. Neocarta provides two purpose-scoped connectors for it:Documentation Index
Fetch the complete documentation index at: https://mintlify.com/neo4j-labs/neocarta/llms.txt
Use this file to discover all available pages before exploring further.
DataplexSchemaConnector reads BigQuery structural metadata (tables and columns) from the Dataplex catalog, and DataplexGlossaryConnector reads business glossary terms and links them to the schema entities via TAGGED_WITH edges.
DataplexSchemaConnector
Extracts BigQuery catalog metadata via the DataplexCatalogServiceClient and maps it to Database, Schema, Table, and Column nodes.
Dataplex catalog metadata is less comprehensive than reading BigQuery Information Schema directly. Primary and foreign keys are not available via the Dataplex API, so columns are loaded without
is_primary_key / is_foreign_key flags and no REFERENCES edges are produced. Use BigQuerySchemaConnector if you need foreign key relationships.Databasenode for the GCP projectSchemanodes for each datasetTablenodes with descriptionsColumnnodes with types, nullability, and descriptions
Import
Parameters
An authenticated Dataplex Catalog client.
GCP project ID (e.g.
"my-project").GCP project number (e.g.
"123456789012"). Required for Dataplex entry path construction.Dataplex location, e.g.
"us" or "us-central1".Connected Neo4j driver instance.
Target Neo4j database name.
ingest() Parameters
The BigQuery dataset ID to extract.
Code Example
CLI
Required Environment Variables
| Variable | Purpose |
|---|---|
NEO4J_URI | Neo4j connection URI |
NEO4J_USERNAME | Neo4j username |
NEO4J_PASSWORD | Neo4j password |
NEO4J_DATABASE | Target Neo4j database (default: neo4j) |
GCP_PROJECT_ID | GCP project ID |
GCP_PROJECT_NUMBER | GCP project number |
DATAPLEX_LOCATION | Dataplex location (e.g. us, us-central1) |
BIGQUERY_DATASET_ID | BigQuery dataset ID |
DataplexGlossaryConnector
Reads Dataplex business glossary content and catalog-to-glossary entry links, producing Glossary, Category, and BusinessTerm nodes along withTAGGED_WITH edges that connect them to the schema entities ingested by DataplexSchemaConnector.
What it ingests:
Glossarynodes with name, description, and resource pathCategorynodes within each glossaryBusinessTermnodes within each category(:Glossary)-[:HAS_CATEGORY]->(:Category)edges(:Category)-[:HAS_BUSINESS_TERM]->(:BusinessTerm)edges(:Column)-[:TAGGED_WITH]->(:BusinessTerm)edges (wheninclude_entry_links=True)(:Table)-[:TAGGED_WITH]->(:BusinessTerm)edges (wheninclude_entry_links=True)
Import
Parameters
An authenticated Dataplex Business Glossary client.
GCP project ID.
GCP project number.
Dataplex location, e.g.
"us" or "us-central1".Connected Neo4j driver instance.
Target Neo4j database name.
ingest() Parameters
Whether to also ingest catalog↔glossary entry links (
TAGGED_WITH edges). Set to False if the schema catalog is not loaded in this Neo4j instance, or to skip the REST API round-trips when you only want the glossary content itself.Code Example
CLI
Required Environment Variables
| Variable | Purpose |
|---|---|
NEO4J_URI | Neo4j connection URI |
NEO4J_USERNAME | Neo4j username |
NEO4J_PASSWORD | Neo4j password |
NEO4J_DATABASE | Target Neo4j database (default: neo4j) |
GCP_PROJECT_ID | GCP project ID |
GCP_PROJECT_NUMBER | GCP project number |
DATAPLEX_LOCATION | Dataplex location (e.g. us, us-central1) |