This step-by-step guide will help you set up + connect a new BigQuery warehouse with the Houseware application. After completing the steps in this guide, you can view all selected product events on your Houseware instance to start analyzing and visualizing them.
- You will see the page below on logging in for the first time (before setting up). Choose the "Plug Houseware to work on your Cloud data warehouse" option here.
- Next, select BigQuery as the data warehouse you want to connect on reaching the following page.
To note before connecting
Before starting the connection process, make sure that you have the necessary access to create and manage service accounts (A service account is needed to give access to Houseware to operate on data). Check out the section below to know how to get the correct permissions.
To get the permissions that you need to manage access to a project, folder, or organization, ask your administrator to grant you the following IAM roles on the resource that you want to manage access for (project, folder, or organization):
- To manage access to a project: Project IAM Admin (
- To manage access to a folder: Folder Admin (
- To manage access to projects, folders, and organizations: Organization Admin (
- To manage access to almost all Google Cloud resources: Security Admin (
Now with the necessary permissions, we can move on to creating a warehouse connection.
For Houseware to interact with the service account of your google cloud project, it needs to connect securely with the account's service identity. This is done by creating a private service account key in .JSON format, which is to be uploaded on the Houseware app.
- Go to your google cloud console
- Select your organization and project from the top left
- Go to the navigation menu and go to Service accounts
- In the service accounts section, choose the service account for which they key needs to be generated and click on manage keys
- Click on create a new key
- Create a JSON keyfile, and upload it on Houseware app
The next step is to ingest a GCP bucket to ingest data from Fivetran (our data connector). Also, ensure the bucket is in the exact location of the dataset(US). You will also have to change bucket access to fine-grained
What's a bucket and what exactly are we doing
A bucket is nothing more than a container for your data. Everything that you store on google cloud storage resides in a bucket. You can use buckets to organize your data and control access to your data. We would be needing Storage Object Admin permission for the bucket to read and write data from the bucket. All the models and metrics you create get stored in your own warehouse this way.
Access Control: Fine-Grained
When data is stored together in the cloud, fine-grained access control is essential since it allows data with different access requirements to ‘live’ in the same storage space without running into security or compliance issues. Fine-grained access control uses more nuanced and variable methods for allowing access. Read More.
- Click on the top-left menu and go to the Cloud Storage section
- Go to your bucket and open settings, click on edit access
- Make sure the access control setting is configured to fine-grained
- We need to set up Storage Object Admin for the bucket now, click on add principal
- Here select the role of the principal as Storage Object Admin. Go to Cloud Storage -> Storage Object
Enter the bucket name on Houseware!
Step 4: Name the schema on which you want Houseware to materialize the transformed tables and metrics
Click on the Connect button to complete the connection.
Your BigQuery is now connected to your Houseware application!
Updated 4 months ago