Energy efficiency; Physical design; Power management; Database administrators; Hardware and software; Materialized view selection; Non-functional requirements; Query performance; Redundant optimizations; Very large database; Software; Information Systems; Hardware and Architecture
Abstract :
[en] In the Big Data Era, the management of energy consumption by servers and data centers has become a challenging issue for companies, institutions, and countries. In data-centric applications, Database Management Systems are one of the major energy consumers when executing complex queries involving very large databases. Several initiatives have been proposed to deal with this issue, covering both the hardware and software dimensions. They can be classified in two main approaches assuming that either (a) the database is already deployed on a given platform, or (b) it is not yet deployed. In this study, we focus on the first set of initiatives with a particular interest in physical design, where optimization structures (e.g., indexes, materialized views) are selected to satisfy a given set of non-functional requirements such as query performance for a given workload. In this paper, we first propose an initiative, called Eco-Physic, which integrates the energy dimension into the physical design when selecting materialized views, one of the redundant optimization structures. Secondly, we provide a multi-objective formalization of the materialized view selection problem, considering two non-functional requirements: query performance and energy consumption while executing a given workload. Thirdly, an evolutionary algorithm is developed to solve the problem. This algorithm differs from the existing ones by being interactive, so that database administrators can adjust some energy sensitive parameters at the final stage of the algorithm execution according to their specifications. Finally, intensive experiments are conducted using our mathematical cost model and a real device for energy measurements. Results underscore the value of our approach as an effective way to save energy while optimizing queries through materialized views structures.
Disciplines :
Computer science
Author, co-author :
Roukh, Amine ; Université de Mons - UMONS > Faculté Polytechnique > Service Informatique, Logiciel et Intelligence artificielle ; University of Mostaganem, Mostaganem, Algeria
Bellatreche, Ladjel; LIAS/ISAE-ENSMA, University of Poitiers, Poitiers, France
Bouarar, Selma; LIAS/ISAE-ENSMA, University of Poitiers, Poitiers, France
Boukorca, Ahcene; LIAS/ISAE-ENSMA, University of Poitiers, Poitiers, France
Language :
English
Title :
Eco-Physic: Eco-Physical design initiative for very large databases
[1] A. Roukh, L. Bellatreche, A. Boukorca, S. Bouarar, Eco-dmw: eco-design methodology for data warehouses, in: Proceedings of the Eighteenth International Workshop on Data Warehousing and OLAP, ACM, 2015, pp. 1–10.
[2] T.N.R.D. Council, Scaling up energy efficiency across the data center industry: evaluating key drivers and barriers, Issue Paper. 〈 http://www.nrdc.org/energy/files/data-center-efficiency-assessment-IP.pdf〉, August 2014 (accessed 06.04.16.).
[3] E. Liebert, Five strategies for cutting data center energy costs through enhanced cooling efficiency, White Paper. 〈 http://www.emersonnetworkpower.com/documentation/en-us/brands/liebert/documents/white%20papers/data-center-energy-efficiency_151-47.pdf〉, 2007 (accessed 06.04.16.).
[4] D. Tsirogiannis, S. Harizopoulos, M.A. Shah, Analyzing the energy efficiency of a database server, in: Proceedings of the Sigmod, 2010, pp. 231–242.
[5] G.e Sustainability Initiative, I. the Boston Consulting Group, Gesi Smarter 2020: The Role Of Ict In Driving A Sustainable Future, Press Release, December 2012.
[6] Poess, M., Nambiar, R.O., Energy cost, the key challenge of today's data centers: a power consumption analysis of tpc-c results. PVLDB 1:2 (2008), 1229–1240.
[7] Tu, Y.-C., Wang, X., Zeng, B., Xu, Z., A system for energy-efficient data management. ACM SIGMOD Rec. 43:1 (2014), 21–26.
[8] Agrawal, R., Ailamaki, A., Bernstein, P.A., Brewer, E.A., Carey, M.J., Chaudhuri, S., Doan, A., Florescu, D., Franklin, M.J., Garcia-Molina, H., et al. The claremont report on database research. ACM SIGMOD Rec. 37:3 (2008), 9–19.
[9] Abadi, D., Agrawal, R., Ailamaki, A., Balazinska, M., Bernstein, P.A., Carey, M.J., Chaudhuri, S., Dean, J., Doan, A., Franklin, M.J., et al. The beckman report on database research. Commun. ACM 59:2 (2016), 92–99.
[10] Z. Xu, Y.-C. Tu, X. Wang, Dynamic energy estimation of query plans in database systems, in: Proceedings of the 33rd International Conference on Distributed Computing Systems (ICDCS), IEEE, 2013, pp. 83–92.
[11] M. Kunjir, P.K. Birwa, J.R. Haritsa, Peak power plays in database engines, in: Proceedings of the EDBT, ACM, 2012, pp. 444–455.
[12] Lang, W., Kandhan, R., Patel, J.M., Rethinking query processing for energy efficiency: slowing down to win the race. IEEE Data Eng. Bull. 34:1 (2011), 12–23.
[13] Beloglazov, A., Buyya, R., Lee, Y.C., Zomaya, A., et al. A taxonomy and survey of energy-efficient data centers and cloud computing systems. Adv. Comput. 82:2 (2011), 47–111.
[14] S. Harizopoulos, M. Shah, J. Meza, P. Ranganathan, Energy efficiency: The New Holy Grail Of Data Management Systems Research, arXiv preprint arXiv:0909.1784.
[15] G. Graefe, Database servers tailored to improve energy efficiency, in: Proceedings of the 2008 EDBT workshop on Software engineering for tailor-made data management, ACM, 2008, pp. 24–28.
[16] Iman, E., Ashraf, A., Daniel, C.Z., Calisto, Z., Recommending XML physical designs for XML databases. VLDB J. 22:4 (2013), 447–470.
[17] S. Chaudhuri, V.R. Narasayya, Self-tuning database systems: a decade of progress, in: Proceedings of the VLDB, 2007, pp. 3–14.
[18] Imene, M., Zohra, B., A survey of view selection methods. SIGMOD Rec. 41:1 (2012), 20–29.
[19] H. Gupta, I.S. Mumick, Selection of views to materialize under a maintenance cost constraint, in: Proceedings of the ICDT, 1999, pp. 453–470.
[20] Zhou, A., Qu, B., Li, H., Zhao, S., Suganthan, P.N., Zhang, Q., Multiobjective evolutionary algorithms: a survey of the state of the art. Swarm Evolut. Comput. 1:1 (2011), 32–49.
[21] Deb, K., Pratap, A., Agarwal, S., Meyarivan, T., A fast and elitist multiobjective genetic algorithm: Nsga-ii. IEEE Trans. Evolut. Comput. 6:2 (2002), 182–197.
[22] Roukh, A., Bellatreche, L., Eco-processing of olap complex queries. Big Data Anal. Knowl. Discov., 2015, 229–242.
[23] Roukh, A., Estimating power consumption of batch query workloads. Model Data Eng., 2015, 198–212.
[24] W. Lang, J. Patel, Towards eco-friendly database management systems, arXiv preprint arXiv:0909.1767.
[25] Z. Xu, X. Wang, Y.-C. Tu, Power-aware throughput control for database management systems, in: Proceedings of the ICAC, 2013, pp. 315–324.
[26] Intel, Oracle, Oracle Exadata on Intel® Xeon® Processors: Extreme Performance for Enterprise Computing, 〈 https://www.oracle.com/engineered-systems/exadata/index.html〉, 2014 (accessed 06.04.16.).
[27] L. Woods, Z. István, G. Alonso, Ibex: an intelligent storageengine with support for advanced sql offloading, in: Proceedings of the VLDB Endowment, vol. 7(11), 2014, pp. 963–974.
[28] M. Rofouei, T. Stathopoulos, S. Ryffel, W. Kaiser, M. Sarrafzadeh, Energy-aware high performance computing with graphic processing units, in: Proceedings of the Workshop on Power Aware Computing and System, 2008.
[29] Mehdipour, F., Noori, H., Javadi, B., Chapter two-energy-efficient big data analytics in datacenters. Adv. Comput. 100 (2016), 59–101.
[30] J. Do, Y.-S. Kee, et al., Query processing on smart ssds: opportunities and challenges, in: Proceedings of the ACM SIGMOD, ACM, 2013, pp. 1221–1230.
[31] P. Behzadnia, W. Yuan, B. Zeng, Y.-C. Tu, X. Wang, Dynamic power-aware disk storage management in database servers, in: Proceedings of the DEXA, Springer International Publishing, 2016, pp. 315–325.
[32] D. Schall, V. Hudlet, T. Härder, Enhancing energy efficiency of database applications using ssds, in: Proceedings of the Third C* Conference on Computer Science and Software Engineering, ACM, 2010, pp. 1–9.
[33] S.-K. Cheong, C. Lim, B.-C. Cho, Database processing performance and energy efficiency evaluation of ddr-ssd and hdd storage system based on the tpc-c, in: Proceedings of the 2012 International Conference on Cloud Computing and Social Networking (ICCCSN), IEEE, 2012, pp. 1–3.
[34] R. Appuswamy, M. Olma, A. Ailamaki, Scaling the memory power wall with dram-aware data management, in: Proceedings of the 11th International Workshop on Data Management on New Hardware, ACM, 2015, p. 3.
[35] M. Korkmaz, A. Karyakin, M. Karsten, K. Salem, Towards dynamic green-sizing for database servers, in: Proceedings of the International Workshop on Accelerating Data Management Systems Using Modern Processor and Storage Architectures (ADMS), 2015, pp. 25–36.
[36] A. Hassan, H. Vandierendonck, D.S. Nikolopoulos, Energy-efficient in-memory data stores on hybrid memory hierarchies, in: Proceedings of the 11th International Workshop on Data Management on New Hardware, ACM, 2015, p. 1.
[37] Z. Xu, Y.-C. Tu, X. Wang, Exploring power-performance tradeoffs in database systems, in: Proceedings of the ICDE, 2010, pp. 485–496.
[38] Rodriguez-Martinez, M., Valdivia, H., Seguel, J., Greer, M., Estimating power/energy consumption in database servers. Procedia Comput. Sci. 6 (2011), 112–117.
[39] Lang, W., Harizopoulos, S., Patel, J.M., Shah, M.A., Tsirogiannis, D., Towards energy-efficient database cluster design. Proc. VLDB Endow. 5:11 (2012), 1684–1695.
[40] I. Psaroudakis, T. Kissinger, D. Porobic, T. Ilsche, E. Liarou, P. Tözün, A. Ailamaki, W. Lehner, Dynamic fine-grained scheduling for energy-efficient main-memory queries, in: Proceedings of the Tenth International Workshop on Data Management on New Hardware, ACM, 2014, p. 1.
[41] W. Kang, S.H. Son, J.A. Stankovic, Power-aware data buffer cache management in real-time embedded databases, in: Proceedings of the 14th IEEE International Conference on Embedded and Real-Time Computing Systems and Applications, 2008. RTCSA'08, IEEE, 2008, pp. 35–44.
[42] A. Roukh, L. Bellatreche, C. Ordonez, Enerquery: energy-aware query processing, in: Proceedings of the 24rd ACM International Conference on Conference on Information and Knowledge Management, ACM, 2016.
[43] Wang, J., Feng, L., Xue, W., Song, Z., A survey on energy-efficient data management. ACM SIGMOD Rec. 40:2 (2011), 17–23.
[44] Schall, D., Härder, T., Wattdb-a journey towards energy efficiency. Datenbank-Spektrum 14:3 (2014), 183–198.
[45] F. Fusco, U. Fischer, V. Lonij, P. Pompey, J.-B. Fiot, B. Chen, Y. Gkoufas, M. Sinn, Data management system for energy analytics and its application to forecasting, in: Proceedings of the Joint EDBT/ICDT PhD workshop, 2016.
[46] R. Silipo, P. Winters, Big data, smart energy, and predictive analytics, White Paper, 〈 https://www.knime.org/files/knime_bigdata_energy_timeseries_whitepaper.pdf〉, 2013 (accessed 06.04.16.).
[47] L. Siksnys, C. Thomsen, T.B. Pedersen, Mirabel dw: Managing complex energy data in a smart grid, in: Proceedings of the DaWaK, 2012, pp. 443–457.
[48] Khalefa, M.E., Fischer, U., Pedersen, T.B., Lehner, W., Model-based integration of past & future in timetravel. Proc. VLDB Endow. 5:12 (2012), 1974–1977.
[49] Fischer, U., Dannecker, L., Siksnys, L., Rosenthal, F., Boehm, M., Lehner, W., Towards integrated data analytics: time series forecasting in dbms. Datenbank-Spektrum 13:1 (2013), 45–53.
[50] D. Ameller, C. Ayala, J. Cabot, X. Franch, How do software architects consider non-functional requirements: An exploratory study, in: Proceedings of the 20th International Requirements Engineering Conference (RE), IEEE, 2012, pp. 41–50.
[52] J.C. McCullough, Y. Agarwal, J. Chandrashekar, S. Kuppuswamy, A.C. Snoeren, R.K. Gupta, Evaluating the effectiveness of model-based power characterization, in: Proceedings of the USENIX Annual Technical Conference, 2011.
[53] H. Mistry, P. Roy, S. Sudarshan, K. Ramamritham, Materialized view selection and maintenance using multi-query optimization, in: Proceedings of the ACM SIGMOD Record, Vol. 30, ACM, 2001, pp. 307–318.
[54] M. Lawrence, A. Rau-Chaplin, Dynamic view selection for olap, in: Proceedings of the International Conference on Data Warehousing and Knowledge Discovery, Springer, 2006, pp. 33–44.
[55] J. Yang, K. Karlapalem, Q. Li, Algorithms for materialized view design in data warehousing environment, in: Proceedings of the VLDB, 1997, pp. 25–29.
[56] Boukorca, A., Bellatreche, L., Senouci, S.-A.B., Faget, Z., Coupling materialized view selection to multi query optimization: hyper graph approach. Int. J. Data Warehous. Min. (IJDWM) 11:2 (2015), 62–84.
[57] P. ONeil, E. ONeil, X. Chen, S. Revilak, The star schema benchmark and augmented fact table indexing, in: Proceedings of the Performance evaluation and benchmarking, Springer, 2009, pp. 237–252.
[58] C.-L. Hwang, A.S.M. Masud, Multiple objective decision making – methods and applications: a state-of-the-art survey, vol. 164, Springer-Verlag Berlin Heidelberg, 1979.
[59] S. Borzsony, D. Kossmann, K. Stocker, The skyline operator, in: Proceedings of the ICDE, 2001, pp. 421–430.
[60] S. Barielle, Calculating tco for energy, IBM Systems Magazine, 〈 http://www.ibmsystemsmag.com/mainframe/Business-Strategy/ROI/energy_estimating/〉, November 2011 (accessed 06.04.16.).
[61] Talebi, Z.A., Chirkova, R., Fathi, Y., An integer programming approach for the view and index selection problem. Data Knowl. Eng. 83 (2013), 111–125.