What is the difference between CDM and CDP?
CDP (Continuous Data Protection, continuous data protection), which uses the method of recording reference data and incremental data logs, and then indexing the logs, allows users to see any copy of historical data in a short period of time. However, the CDP system is more focused on getting and protecting data. As for how this data is used, it does not seem to have been valued for a long time.
CDM (Copy Data Management, data copy management), the starting point of this technical concept is that it does not care how the data is obtained, such as through traditional backup or CDP, and does not care where the data is placed, such as local hard disk , SAN, distributed storage, cloud storage, etc. It focuses on how to better manage and utilize the acquired data, and better use in combination with applications.
The scenario that CDP is aimed at is continuous data protection, backup, and recovery. Although CDP can also provide a historical point-in-time image of the data, no matter in the original design intention, configuration process, or use process, it will provide a strong service for data recovery Color, not for data use. On the contrary, CDM is oriented towards how to use the corresponding data.
Snapshots in traditional storage systems also face this problem. A bunch of snapshots have been made. They are more often used as data stubs for emergencies. Although manufacturers have also claimed that snapshots can be used in normal data services, they are just " "You can" use it, you will find that it lacks a lot of ease of use.
Why is there a CDM?
With the advent of the era of big data, enterprises are facing market competition, and they are beginning to consider how to increase the use value of data and dig out the effective information hidden in the data, so as to quickly improve the core competitiveness of the enterprise. Enterprise production data is not only used for business production, there are many non-production environments that also need these data to support it. When developing a new system, you need to copy the production data to the R & D environment for development; when testing the system, you need to copy the production data to the test environment for debugging; for data statistics and analysis, you need to copy the production data to a statistical analysis system for processing.
What can CDM do?
CDM can separate online data without transferring business access, and transfer query, test, and analysis traffic to non-production systems. The obtained copy of the data is immediately available, and the separated data quickly provides data support for the business through data reorganization, application pattern matching, and data cleaning. CDM can achieve process management and simplify the management complexity of replica data through automated process management. CDM can also improve the efficiency of data management, combined with log processing technology, improve the real-time nature of the data, and can quickly process larger and more diverse data.