Data & Knowledge Engineering
School of ITEE

Keynote Speech


Venue: Theatre 310

Time: 9:30-10:30, Day 1 April 13 (Mon)

Keynote Speech: Challenges in Scalable Platforms for Social Event Identification with a Case Study

by Dr Gene Guogen Zhang, 

Abstract: In this talk, I will present a scalable big data platform that is aimed to analyze large volume of data for social event identification as well as other analytics. The platform consists of HDFS and local file systems that support storage for unstructured/semi-structured data from surveillance and social networking, as well as structured data in a database from operational data sources such as carriers, banks, and transportation, etc. It follows so-called Lambda architecture that incorporates streaming for real-time data ingestion and near real-time query support. After pre-processing, most data becomes structured, and using a shared-nothing MPP database can result in high-performance. I will use a public safety application built on Huawei's FusionInsight platform with MPPDB and SQL-on-Hadoop as an example to illustrate how a scalable social event identification application could be constructed today. And I will also point out many challenges remaining for a more advanced system. 

Bio: Dr. Gene (Guogen) Zhang is Chief Architect of Databases at Huawei, responsible for database and big data management technology and products, supporting challenging customer solutions. He was a Distinguished Engineer in DB2 for z/OS development at IBM Silicon Valley Lab before joining Huawei. At IBM He was the chief architect and development lead responsible for the delivery of pureXML and SQL query technology in DB2 for z/OS, and has delivered important functionality including pureXML, complex queries and analytics features in DB2 for z/OS. He was a frequent speaker in many conferences, and has been deeply involved in providing solutions to customers world-wide. Dr. Zhang was also an IBM Master Inventor.