A member of Data Lake team, responsible to build, develop and maintain the data pipeline of our strategic Big Data platform, Data Lake serving for management needs, business operations of the bank and comply with Data Governance.
MAIN RESPONSIBILITIES :
Develop and maintain scalable and reliable data pipelines to ingest data from a variety of different data sources into Data Lake, ensure right data format and adhere to data quality standards, assure the downstream users can get the data quickly
Develop and maintain highly scalable and extensible Big Data platform which enables collection, storage, modeling, and analysis of massive data sets from numerous channels.
Define and maintain data pipeline, data structure, data format to enable business solution
Develop and enable big data and batch / real-time analytical solutions that leverage emerging technologies.
Works in a team to build next-generation Hadoop data lake and analytics applications on a group of core Hadoop technologies
Evaluate new technologies and products, and research to identify opportunities that impact business strategy, business requirements and performance that can accelerate access to data.
Work with Advanced Analytics Team to plan and execute high-impact actionable insight generation through big data advanced analytics including predictive analytics, advanced Machine Learning Technologies that reduce cost and improve analytics speed to insight by accelerating the pace of Big Data innovation at ACB.
Ensure proper configuration management and change controls are implemented during code migration.