Click Here to Download: https://ouo.io/F3t3Ah Large Scale Data Analytics By: Chung Yik Cho; Rong Kun Jason Tan; John A. Leong; Amandeep S. Sidhu Publisher: Springer Print ISBN: 9783030038915, 3030038912 eText ISBN: 9783030038922, 3030038920 Copyright year: 2019 Format: EPUB Available from $ 139.00 USD SKU 9783030038922 This book presents a language integrated query framework for big data. The continuous, rapid growth of data information to volumes of up to terabytes (1,024 gigabytes) or petabytes (1,048,576 gigabytes) means that the need for a system to manage and query information from large scale data sources is becoming more urgent. Currently available frameworks and methodologies are limited in terms of efficiency and querying compatibility between data sources due to the differences in information storage structures. For this research, the authors designed and programmed a framework based on the fundamentals of language integrated query to query existing data sources without the process of data restructuring. A web portal for the framework was also built to enable users to query protein data from the Protein Data Bank (PDB) and implement it on Microsoft Azure, a cloud computing environment known for its reliability, vast computing resources and cost-effectiveness.