3. märtsil kell 14.00 peab infosüsteemide dotsendi kandidaat Sherfi Sakr venia legendi loengu teemal „Big Data Systems: Research Challenges and Opportunities“.
Lühikokkuvõte inglise keeles:
For about a decade, the Hadoop framework has been recognized as the de facto standard of big data analytics and processing systems. The Hadoop framework was popularly employed as an effective solution that can harness the resources and power of large computing clusters in various application domains. Due to its wide success, popular technology companies (e.g., IBM, Oracle and Microsoft) have decided to support the Hadoop framework in their commercial big data processing platforms.
Additionally, many emerging startups such as Cloudera, MapR, Trifacta, Platfora among many others have designed their services and solutions based on the Hadoop framework. However, recently, both the research and industrial worlds identified various limitations in the Hadoop framework and thus it started to be acknowledged that the Hadoop framework cannot represent the one-size-fits-all solution for the various big data analytics challenges. In this talk, we present our view on Big Data 2.0 processing platforms which represent a new generation of engines (e.g., Spark, Flink, Giraph, GraphLab, Storm) that are domain-specific, dedicated to specific verticals (e.g. structured data, big graphs, data
streams) and slowly replacing the Hadoop framework in various contexts.
We classify these system, discuss their technical details and adequate application scenarios. We present different big data systems that have been developed through our research in this domain. In addition, we highlight some of the open challenges in the domain of big data processing systems.