Skip to main content Skip to complementary content

Dataset detailed view

When clicking a dataset from the dataset list, you access its detailed view.

Information noteNote: This feature becomes available for Talend Cloud Pipeline Designer and Talend Cloud Data Preparation users when Talend Cloud Data Inventory is enabled for the account.
The detailed view allows you to have a clear vision of your dataset and its content, metadata, quality, and other properties, in addition to the social and collaborative features. For any given dataset, the detailed view gives you access to the following panels:
  • The Dataset overview
    Dataset overview

    This is the first panel that opens when clicking a dataset from the list. You can get several information at a glance, including the dataset metadata and a simple quality indicator.

  • The Dataset sample
    Dataset sample

    This is where you can directly visualize your data in different forms, check its quality, and even change the semantic category of your columns.

  • The Managing Data APIs
    API

    This menu allows you to enable an API to easily share the content of the dataset to the consumers of your choice.

  • The Dataset properties
    Dataset properties

    On this page, you can view and edit the properties of your dataset, and generate a new sample based on the new configuration.

Dataset overview

When selecting a dataset from the list, the dataset overview panel opens, displaying different information and metadata.

Information noteNote: This feature becomes available for Talend Cloud Pipeline Designer and Talend Cloud Data Preparation users when Talend Cloud Data Inventory is enabled for the account.
The information that you can find at a glance, is structured in the form of tiles:
  • Talend Trust Score™: Visualize the Talend Trust Score™ of your dataset around five metrics axis and learn how to improve its global trustworthiness.
  • Data quality: Get a quick look at the quality of your data with dedicated bar charts that show the repartition of empty, invalid, and valid values across the entire dataset.
  • Data quality rules: List of rules applied to this dataset. Each compliance bar lets you see the repartition of invalid, non-applicable and valid values.
  • Schema: See the list of columns that make up the structure of your dataset, as well as the semantic type and quality for each column.
  • Preparations: List of preparations that use this dataset as source, as well a list of preparations that are compatible with this dataset and can be directly applied.
  • Pipelines: List of pipelines that use this dataset as source or destination.
  • Rating: This tile allows you to apply or edit your individual rating, as well as having access to the global rating of the dataset.
  • Description: The optional description that you entered during the dataset creation can be found here. It can also be edited to include any other context information you want to share on this dataset.
  • Custom attributes: All the custom attributes definitions that have been created for the tenant are regrouped in this tile. From there, you can apply a value to any of the categories or modify an existing one to complete the dataset metadata.
  • Tags: Easily apply tags to better document your dataset and improve its searchability.
  • API: This tile is visible for compatible datasets. It allows you to enable an API, so that consumers can get the dataset information, and monitor its activity.
  • Details: This tile regroups the basic information about the dataset creator, the creation and last modification dates, as well as who modified it.
Dataset overview panel
Dataset overview panel showing Talend Trust Score™ information, data quality, data quality rules, as well as the schema of a dataset.

Dataset sample

After creating a dataset, you can visualize and understand its content via the sample view.

Information noteNote: This feature becomes available for Talend Cloud Pipeline Designer and Talend Cloud Data Preparation users when Talend Cloud Data Inventory is enabled for the account.

Talend Cloud Data Inventory can display a sample of 10,000 records of your datasets. It includes Dataset quality at the dataset and column level, and you will also be able to Changing the semantic type of a column so that the data is well defined.

The Head sample is selected by default, it displays the first 10,000 records of your dataset. If you need to work on a more representative sample of your dataset, click the arrow next to Head sample and select Random sample to display 10,000 randomly selected records.
Airlines dataset sample
Aircrafts dataset sample. The mouse hovers on a drop-down list where you can choose between 'Head sample' and 'Random sample' options.

Dataset properties

The dataset definition and properties can be checked at any time in the dataset properties panel.

Information noteNote: This feature becomes available for Talend Cloud Pipeline Designer and Talend Cloud Data Preparation users when Talend Cloud Data Inventory is enabled for the account.

The dataset properties can be accessed from the following locations:

  • From the dataset list:
    Dataset list showing a selected dataset with the 'Edit this dataset' option highlighted.
  • From the dataset detailed view:
    Properties of the 'Aircrafts' dataset.

This page is a direct way to check, or modify some fields that have been filled during the dataset creation, and that need to be updated.

The properties that are available in the form depend on the dataset type, and can include for example:

  • The name of your dataset
  • The input of your test datasets
  • The record and field delimiter, enclosure, and escape character, or encoding of a CSV file
  • The table and query for database datasets
  • The Salesforce modules, columns, and conditions
  • The HDFS URL

You can click the View sample button to preview a few records of the new sample before validating and generating it.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!