Create dataset and connect it to a source
Follow these two primary steps to create a dataset and connect it to a source.
Step 1: Create a dataset
Start by giving your dataset a unique name. Once you have named it, the platform will assign it a dataset ID. This ID is not fixed and can be edited according to your needs.
Step 2: Connect your dataset to a configured source
Once your dataset has been created, the next step is to connect it to a source that has been previously configured.
If you don't have a source yet
If you have not set up a source yet, don't worry. Follow the instructions provided to configure a new source (link to documentation of connecting to a source).
Select the relevant source
From your list of configured sources, select the one that is relevant to the dataset you just created.
For S3 or GCS
Notification action selection:
Choose which action will trigger a notification to the platform. The options include either the insertion of new data or an update to the existing data.
Specify the file or folder path:
Finally, provide the specific path to the file or folder in your source that is relevant to your new dataset. The path will already contain a prefix, so you should only fill in what comes after the prefix. The provided prefix:
s3://<bucket name>
gs://<bucket name>

Updated about 1 year ago