Updated April 7, 2023
Introduction to Tableau
A data visualization tool developed in Salesforce to connect with any database, be it SQL or MongoDB, and interact freely is called Tableau. It is mostly used in the Business Intelligence industry, and raw data is simplified easily to any format understandable by the users. Visualizations are made in the form of dashboards, and data should be represented in tabular format. It is also used in reporting and is mostly called a reporting tool. It helps to explore data, visualize and prepare reports for the same data. It is developed using C++.
Need, Scope and Features
The overwhelming growth rate of available data volumes and the pressing need for the decision-makers in all domains of business and research to make quick and accurate decisions increased the scope for data visualization tools to understand data with graphics. As a result, the requisite efforts and time to create and maintain graphics packages and the level of expertise to understand graphics jargon is greatly reduced.
The salient features offered by Data Visualization tools are:
1. First, ease use of tool functionalities avoids the need for in-depth training on technology for the creation or interaction of views in order to derive valuable insights.
2. Scalability to the rapidly growing data size and to the increased data complexity comprising scalar, vector, and tensor data types without compromising the performance of the tool.
3. Ability to handle heterogeneous data by possessing data-connectors or parsers for various sources that hold or store.
- Structural data as in Relational Databases (Oracle, MySQL, PostgreSQL).
- Semi-structured data like XML files, JSON payloads.
- Unstructured data in NoSQL databases like MongoDB, Couchbase, etc.
- Cloud data storage like Oracle Cloud, AWS and Microsoft Azure, etc.
4. Integration of view manipulation features like slicing and dicing, roll up to the required level of detail, filtering the data to help in detailed explorative data analysis.
5. Support for semantic visualization to perform an automatic selection of suitable visualization primitives such as line charts, bar charts, scatter plots, Tree diagrams, etc., for nominal, ordinal, and numerical data.
One of the most popular data visualization tools proved to be the best in visual-based data exploration.
Working with Tableau
A brief description of key concepts to work is:
1. Dimensions and Measures
It classifies the data into Dimensions and Measures. The independent fields that cannot be aggregated are considered as Dimensions and those fields that can be aggregated and are context-dependent are referred to as Measures.
For example, consider the Superstore dataset in which Region, Year are dimensions that provide context to add meaning to the Sales figure, which is a summarizable measure.
2. Level of Detail
The three expressions supported by Tableau to traverse levels of detail are:
- INCLUDE: Aggregation is performed post grouping the INCLUDED list of dimension fields.
{ INCLUDE [Field 1],[Field 2]: SUM([column]) }
- EXCLUDE: Aggregation is performed post grouping the dimension fields that are NOT present in the EXCLUSION list.
{EXCLUDE [Field1], [Field2], [Field3] : sum([column])}
- FIXED: Unlike Include and Exclude expressions, FIXED aggregated values are insensitive, i.e. remain unchanged to the altered order of displayed dimensions (addition of new dimension or hiding existing dimension) in the view.
{ FIXED [Field 1]:SUM([column])}
3. Parameters
Visualization interaction can be enhanced by parameterizing a few aspects of the view.
The common use-cases that need usage of parameters are:
- The inclusion of Top-N Filters.
- Bin Size adjustment of histograms.
- Reference line definition for threshold-based classification.
The range or list of parameter values users can change or choose should be of the same data type as defined in parameter properties.
Generally, the process for the parameters used in Tableau involves the below steps:
- Create New Parameter: Every instance of entering a constant during view creation supports the option – ‘create new parameter’ to enable tuning of the concerned aspect in the published view.
- Show Parameter Control: Parameter controls like Filter boxes, gauges, sliding bars, etc., can be made visible in the view by selecting the ‘show parameter control’ by right-clicking against the parameters in the parameter pane.
- Use Parameter in Calculations: Parameter values returned from the parameter control can be used in calculations, functions or in logical statements so as to provide a provision to users for depicting desired measure/statistic.
4. Sets
Sets are manually-created or conditionally computed subsets of input data to facilitate customized depiction elements in the view.
The features of the sets are:
- Dynamic Groups: Unlike categories due to a grouping that is static and can be created within the dimensions having a large membership, computed sets based on the conditional logic defined on either dimensions or measures dynamically categorizes fields whenever underlying data changes.
- Reusability: Though filters support the conditional categorization of dimensions or measures, the scope of filters is limited to that dimension in the workbook, whereas sets can be saved and reused in calculations or as the component in another set.
- Order of Precedence: Sets have higher precedence than dimension filters or measure filters in the query pipeline so that the list of values displayed to choose in the filters of view will be optimum as per set definition.
5. Table Calculations
Table Calculations operates on data locally visible in the Tableau view to derive measures like running totals within sub-categories, cumulative sum, moving average with tunable window length, the ranking of field values, per cent differences, etc.
Tableau Products
Various products are categorized into:
- Visualization Development products that include Tableau Desktop and Tableau Public (Desktop).
- Visualization Publishing products that include Tableau Server, Tableau Reader, Tableau Online, Tableau Public(Server).
1. Tableau Desktop
Tableau Desktop allows users to create, format and integrate various interactive views and dashboards using the rich set of visualization primitives. It also supports live-up to-date analysis by querying data residing in various native and live-connected databases.
The created visualizations are published by sharing the Tableau Packaged Workbook with extension .twbx that comprises of:
- Tableau Workbook with extension .twb which is an XML document describing visualization templates.
- Tableau Data Extracts with extension .tde file, which is a compressed data source file.
- Other supporting files like images etc.
2. Tableau Server
Tableau Server is the reliable, secure and well-governed enterprise-level environment to share and publish visualizations created using Tableau Desktop. This product acts as a central repository for various data sources in the data engine, users’ security roles and access privilege details and all visualizations across the entire firm.
3. Tableau Public
Tableau Public is a cloud-hosted free version with tool-usage limitations and is bundled with two subproducts, namely Tableau Public (Desktop) and Tableau Public (Server). The views are created in Tableau Public (Desktop) which are saved and published in Tableau Public(server).
The notable limitations with Tableau Public are:
- Locally available data extracts are the only data sources supported.
- The maximum limit on the number of rows to be inputted is one million.
- Unlike Tableau Desktop, users cannot save the report locally and are restricted to save the workbook in Tableau Public(Server) which is accessible to all of its user.
4. Tableau Online
Tableau Online is a cloud-hosted view sharing platform with the ability to connect with cloud databases like Amazon Redshift, Google BigQuery, etc. Scheduled refresh of extracts and live connection with on-premises datastores are done using tableau bridge.
- The inherent benefit offered by the cloud bases solution is reduced infrastructure cost for its users by avoiding associated upgrades, patching and maintenance activities.
- Unlike Tableau Server, editing of workbooks and visualizations need data server connection, and these operations are limited by maximum bound on row count.
5. Tableau Reader
It is a desktop application that enables users to open and perform view interactions like drill-down and roll-up of OLAP cubes, filtering on dimensions, etc. but cannot edit the embedded content in the published visualizations which are built-in Tableau Desktop.
Conclusion
It gained positive customer experience and positioned itself as a leader in Business Intelligence (BI) vendor space. The remarkable capability of translating raw data to visual representations was achieved by offering a large set of data connectors and an easily accessible rich set of visualizations primitives. Amid changing customer needs and growing user expectations, it is aiming to support machine-learning-enabled data preparation and smart data discovery.
Alongside, it is focusing on addressing challenging gaps in enterprise features which are listed below:
- Comprised performance while integrating large and complex data from heterogeneous data need to be addressed.
- Planned to improve the ability to handle large in-memory extracts by replacing TDE data file formats with an in-memory engine named as Hyper.
- Provision of Event-based scheduling and conditional alerts to be featured.
- Development of API interface services for better extensibility.
- Reduced complexity in packaging to perform large deployments of lightweight customers.
Recommended Articles
This has been a guide to What is Tableau? Here we discussed the key concepts, working, features, and products of Tableau. You can also go through our other suggested articles to learn more –