CastorDoc and Snowflake

Snowflake is a cloud-built data warehouse that delivers elasticity and data sharing across multiple clouds.

integration mockup

Why CastorDoc x Snowflake makes sense?

Snowflake is a cloud-based data platform that provides a data warehouse-as-a-service designed for high-performance analytics and large-scale data workloads. Similar to Vertica, finding the most relevant data assets in Snowflake quickly and efficiently can be a challenge, and understanding the lineage between data warehouse tables and other assets like reports or dashboards is not straightforward. CastorDoc, designed to optimize the search for relevant data assets thanks to popularity and advanced filtering options, and providing lineage between data warehouse tables and other assets, can address these challenges. Therefore, integrating CastorDoc with Snowflake makes sense as it will enhance the user experience by making it easier to find relevant data assets quickly, understand their lineage, and ultimately trust and have visibility in the data. This will enable businesses to make better decisions faster, which is critical in today's fast-paced world.

How does CastorDoc x Snowflake integration work?

Similar to the Vertica integration, CastorDoc ingests metadata from Snowflake. This metadata includes table & column names and descriptions, frequently run queries, frequent users of data assets, data lineage links, data quality tests, last data table update, technical and business tags, and more. CastorDoc organises this metadata in an intuitive to use interface for both technical and business users. The ingestion process takes about 30 minutes to set up, and the metadata is available in CastorDoc the next day. It is important to note that CastorDoc does not access the data itself, only metadata. This ensures that your data stays safe and secure while CastorDoc delivers as much value as possible.

API Access: if any metadata element is not available in CastorDoc's native integration, you can ingest it with our comprehensive API.

Important:  CastorDoc do not access the data itself, only metadata. This ensure that you data stays safe & secure while CastorDoc delivers as much value as possible.

What does CastorDoc help you with?

Castor enables you to scale your self-service analytics strategy without losing control. We are designed with real use-cases in mind :

🔎   You work with data you don't know

Your boss asks to build a report on "Churn for Premium Users in 2021". You need to find the relevant dataset, understand the meaning of its column, and use it fast.

✅ Reduce by 95% the time to find the right data asset (source : Lyft)

🧬   A key employee is leaving

Mike, the data engineer that built the entire data infrastructure is leaving at the end of the month. All the knowledge is in his head. He needs to write it down.

✅ 42% of the work not recovered without knowledge management (source : 360Learning)

👩🏽‍🌾  A new employee onboarding

Elsa, data analyst, arrived last week. She has no idea what data the company stores or how it is used. She spends hours asking around to gather knowledge.

✅ New hires are autonomous after day 1

💣  A data pipeline is late

Nelson, customer success analyst, refreshes the "daily active users" dashboard every two minutes. The data hasn't arrived yet. He wants to know what is happening.

✅ 5x less Slack messages on #ask_data

🗺️  No one knows where personal information are

Camila, from data governance, has to map all personal information to comply to GDPR requirements. She needs a list of all data assets and their location.

✅ 70% of employees have access to data that they shouldn’t (source)


snowflake icon
redshift icon
bigquery icon
synapse icon
postgreSQL icon
mysql icon
databricks icon
dbt icon
looker icon
tableau icon
powerbi icon
slack icon

Get in Touch to Learn More

See Why Users Love CastorDoc
Fantastic tool for data discovery and documentation

“[I like] The easy to use interface and the speed of finding the relevant assets that you're looking for in your database. I also really enjoy the score given to each table, [which] lets you prioritize the results of your queries by how often certain data is used.” - Michal P., Head of Data