ESP Journal of Engineering & Technology Advancements |
© 2021 by ESP JETA |
Volume 1 Issue 2 |
Year of Publication : 2021 |
Authors : Nishanth Reddy Mandala |
: 10.56472/25832646/JETA-V1I2P123 |
Nishanth Reddy Mandala, 2021. "ETL in Data Lakes vs. Data Warehouses", ESP Journal of Engineering & Technology Advancements, 1(2): 224-230.
The Extract, Transform, Load (ETL) process plays a pivotal role in data integration, enabling businesses to consolidate data from disparate sources into a unified system for analysis. This paper presents a comparative analysis of ETL processes in Data Lakes and Data Warehouses, focusing on the architecture, performance, and flexibility of each approach. The discussion highlights the distinct roles of data lakes, which handle large volumes of unstructured and semi-structured data, versus traditional data warehouses, which are optimized for structured, relational data. Several case studies and performance evaluations are included to illustrate the strengths and weaknesses of both architectures.
[1] R. Kimball, The Data Warehouse Toolkit: Practical Techniques for Building Dimensional Data Warehouses, Wiley, 1996.
[2] W. H. Inmon, Building the Data Warehouse, John Wiley & Sons, 2002.
[3] A. Silberschatz, H. F. Korth, and S. Sudarshan, Database System Concepts, 5th ed., McGraw-Hill, 2006.
[4] A. Rudra and S. Yeo, ”Data Warehousing and ETL: Theory and Practice,” in International Conference on Information Systems and Data Warehousing, IEEE, 2009, pp. 100–109.
[5] A. Datta and H. Thomas, ”Data Integration Using ETL Technology,” Journal of Database Management, vol. 16, pp. 75–91, 2005.
[6] C. S. Jensen, T. B. Pedersen, and C. Thomsen, ”System Support for ETL Processes,” in ACM Transactions on Database Systems, vol. 29, pp. 33–65, 2004.
[7] D. Brown and K. Lee, ”Data Warehouse Optimization: A Practical Guide,” in Data Warehousing and Knowledge Discovery Conference, Springer, 2008, pp. 145–156.
[8] P. Gupta and M. Jain, ”Blockchain for Secure Decentralized Transactions: A Review,” International Journal of Computer Applications, vol. 12, pp. 105–112, 2010.
[9] H. Finn and R. Cheng, ”Data Transformation Techniques in ETL Systems: An Evaluation,” Journal of Computing Research, vol. 10, pp. 58–69, 2007.
[10] R. Kimball, ”Data Warehousing and Business Intelligence,” Journal of Data Management, vol. 11, pp. 55–75, 1998.
ETL, Data Lakes, Data Warehouses, Big Data, Data Processing, Data Integration, Data Architecture.