Data compare
Note: This feature is available in Enterprise, Ultimate and Team editions only.
Table of contents
Overview
When you work with data, it is common to encounter scenarios where two tables contain almost identical datasets with minor differences. Comparing data across these tables can be crucial for several tasks:
- Visualizing and navigating differences quickly.
Exporting the differing data.
Steps for comparing data
Choose tables
- First, select the tables you wish to compare from the Database Navigator.
Tip: You can compare tables from the same or different databases, or even from different RDBMS, such as PostgreSQL and MySQL.
- Navigate to Database -> Compare/Migrate -> Data Compare in the menu. This will open the Data Compare wizard. At this stage, confirm your choice of source and target tables for the comparison.
Keys configuration
Select the columns that will serve as the unique key for the comparison. Ensure that the number of columns selected is equal. Choosing the wrong keys may result in invalid comparison results.
Note: By default, if the table has a unique key, it will be selected automatically during the initial setup. Without unique keys, rows cannot be properly compared as there is no way to distinguish between individual rows.
Column mapping
If the tables you are comparing have different ordering, naming, or quantities of columns, you can adjust their mappings or skip columns that are not of interest.
You can use the Auto assign button to automatically link columns with the same names.
Note: This page will only feature columns that are not used as the primary key configured on the previous page.
Compare settings
On the final page of the Data Compare wizard, configure settings to tailor the data comparison to your needs:
Setting | Description |
---|---|
Open viewer on finish | Opens the compare viewer for a visual overview of the differences once the comparison is complete. |
Export compare results to file | Exports the SQL script diff to a file. You can choose the output format and specify the output file path. |
Limit compared rows to | Limits how many rows must be compared against each other before ending the comparison. |
Limit different rows to | Limits how many rows must be differ between each other before ending the comparison. |
Limit fetch size to | Determines how many rows should be fetched at once, which helps manage memory and performance during comparison. |
Show modified rows | Determines whether modified rows should be included in the diff or not, allowing you to track changes in the data. |
Show deleted rows | Determines whether deleted rows should be included in the diff or not, providing insight into data that has been removed from the database. |
Show inserted rows | Determines whether newly inserted rows should be included in the diff or not, allowing you to see additions to the database. |
Store results in memory | Stores results in memory for further processing. Note that DBeaver might run out of memory when comparing large tables if this option is enabled. When disabled, results will be stored in a temporary database. |
Open new connections | Opens new physical connections for data reading, which can help manage resources and ensure the stability of data comparison operations. |
Query rows count | Queries row count before performing compare. This lets you track the progress of the comparison but may cause performance faults in some cases. |
Save task | Opens the Save Task window to assist in creating a task during the compare process. This window provides options and settings for creating and configuring a task related to the compare operation. |
Tip: You can export file to a remote file system via Cloud Storage using the Browser remote file system button .
Execution log
Once you have configured all settings and clicked Proceed, the Data Compare wizard will perform the comparison operation:
- An Execution Log window will display the progress and outcome of the comparison process.
- Simultaneously, a separate window will open showing the comparison results. This window provides a detailed view of differences between the datasets based on the specified settings.
Viewing the results
After the comparison process is complete, you can view the results in the viewer. It highlights the differences between the datasets in detail, with each row that shows a discrepancy being clearly marked.
Note: A complete visual preview of the results is available only when Data Compare is launched through the wizard. If launched as part of a task, the visual editor will not open. Instead, only statistics will be recorded in the execution log.
Additional tools
The viewer not only displays the differences but also provides tools to manage and interpret these differences effectively. Here's what you can do within the compare viewer:
Feature | Button image | Description |
---|---|---|
Export changes | The results of the Data Compare can be exported as an SQL script, which includes DELETE , INSERT , and UPDATE statements to make the data in the second table match the first. Alternatively, you can export the results in an HTML format as a table for easy viewing and sharing. | |
Swap tables | The Swap tables button allows you to switch the positions of the source and target data panels within the viewer. | |
Show summary | By clicking the Show Summary button, you can access a statistical summary and analysis of the differences found between the datasets. This summary provides a quick overview of the total changes, including the number of modified, deleted, and inserted rows. |
Tip: You can also export the comparison results table through Export data button in the bottom toolbar or copy rows or individual values that differ.