Castor and

BigQuery

Integrations

Google's serverless, highly scalable enterprise data warehouse, is designed to make data analysts more productive.

Why BigQuery and Castor?

BigQuery is Google Cloud's fully managed, petabyte-scale, and cost-effective analytics data warehouse that lets you run analytics over vast amounts of data in near real-time.

Your might have hundreds or thousands of data tables on BigQuery. Discovery, documentation, ownership and governance is hard with fast growing data assets and data teams. Castor is a plug-and-play layer that help your data team power self-service BI.

What does Castor help you with?

Castor enables you to scale your self-service analytics strategy without losing control. We are designed with real use-cases in mind :

🔎   You work with data you don't know

Your boss asks to build a report on "Churn for Premium Users in 2021". You need to find the relevant dataset, understand the meaning of its column, and use it fast.

✅ Reduce by 95% the time to find the right data asset (source : Lyft)

🧬   A key employee is leaving

Mike, the data engineer that built the entire data infrastructure is leaving at the end of the month. All the knowledge is in his head. He needs to write it down.

✅ 42% of the work not recovered without knowledge management (source : 360Learning)

👩🏽‍🌾  A new employee onboarding

Elsa, data analyst, arrived last week. She has no idea what data the company stores or how it is used. She spends hours asking around to gather knowledge.

✅ New hires are autonomous after day 1

💣  A data pipeline is late

Nelson, customer success analyst, refreshes the "daily active users" dashboard every two minutes. The data hasn't arrived yet. He wants to know what is happening.

✅ 5x less Slack messages on #ask_data

🗺️  No one knows where personal information are

Camila, from data governance, has to map all personal information to comply to GDPR requirements. She needs a list of all data assets and their location.

✅ 70% of employees have access to data that they shouldn’t (source)

BigQuery Set Up

You need to have the needed permissions to create a Google Service Account for Castor. Otherwise refer to someone in your organisation with these permissions.

Client creates a service account for Castor from the Google Console. See how to here

Client grants the needed access to this new service account with the following roles. See how to here

Bigquery Read Session User

Why: To access the cluster via SQL. We use BigQuery sessions to read metadata. This requires the sessions.* permissions. readSessionUser allows read capabilities via SQL yet does not grant by itself any data access.

Bigquery Metadata Viewer

Scope: for all the projects where we need to access metadata to be able to fetch the schemas of your data

Why: to fetch the schemas of your data

Bigquery Job User

Scope: for all the projects where we need to fetch the query log to be able to parse queries and compute lineage and usage stats

Why: to parse queries and compute lineage and usage stats

Bigquery Resource Viewer

Scope: for all the projects where we need to fetch query log for the role which is necessary to access query logs ()

Why: to parse queries and compute lineage and usage stats

These roles only allow Castor to see the data model, the queries and the users, by running queries on the metadata. It is the minimum roles allowing us to read your metadata while not being able to read your data.

Get started in minutes, with your favorite tools

gmail logo
gmail logo
gmail logo
gmail logo
gmail logo
gmail logo
gmail logo
gmail logo
gmail logo
gmail logo
gmail logo
gmail logo
See all integrations

With Castor, your data is well documented and discoverable by everyone in your team.

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Data documentation matters, ask our customers

logo Vestiaire Collective black

"Castor gave us an easy and friendly tool to maintain documentation for thousands of tables. We can now leverage the work we do on data governance to give autonomy to our data people and make them more efficient"

Aleksander Djurka

Chief Information Officer

fairmoney logo

"We have strong hiring plans for the data team. With Castor, new hires can ramp up really fast on our data infrastructure. Also, if an experienced data player was to leave us, I know that most of his data knowledge will stay in the company"

Maxime Laurenty

Data Architect

pumpkin logo

"Thanks to Castor, we managed to increase visibility on our production database.
We had trouble to know what existed, what it meant, when it was refreshed and how it was built."

Madeleine Jullien

Head of Data

logo castor color
Your data has never been so clear and friendly
Linkedin Profil
© 2021 Castor. All registered.