Data Engineer

Data Engineering

Our Mission

Our goal is for everyone to make bolder choices with their finances.

To get there, we're creating an all-in-one ecosystem of financial services for today's generation of goal-getters. That feat takes extraordinary people-those with the guts to challenge the way things are and transform them into something better.

To be part of Team Maya is to be Bolder for Better.

Description: 

The Data Engineer is responsible for the creation, maintenance, and continuous improvement of data pipelines. Part of his/her responsibilities are to implement best practices in data management practices (i.e., cleaning, validation, and transformation of data) make data into usable datasets that can easily be consumed by other teams. 

 

This role will also work closely with the software engineers, data analysts, data scientists and data governance to understand how the data behaves in its respective domain, to clarify business and technical requirements on different data use cases, and to design and create efficient and reliable data pipelines.

 

Within Data Engineering, this person will learn and adopt best practices on data management, data architecture design, and DataOps principles. May it be in Central DE or Distributed DE, a Data engineer is crucial in creating value for downstream teams that use data. 

 

Key Responsibilities:

  • Develop, maintain, and optimize data pipelines, data models, and data management solutions across data warehouses, data/delta lakes, or lakehouse environments.
  • Collaborate with upstream teams (e.g., Mesh Teams) to integrate data sources and with downstream teams to ensure data usability and accessibility.
  • Understand and adhere to existing technology standards and Data Engineering (DE) best practices.

Central DE:

  • Maintain and enhance the overall data architecture, ensuring scalability, high availability, and timely data ingestion.
  • Build and optimize data pipelines for new data sources, applying DataOps principles to ensure seamless operations and minimal disruptions.

Distributed DE:

  • Acquire and maintain deep domain knowledge of assigned data areas to inform data modeling and pipeline development.
  • Design and develop data models for Zone 2 (silver layer) and Zone 3 (gold layer), ensuring business datasets are accurate, reliable, and ready for downstream consumption.

Qualifications

  • Good working knowledge on Shell (e.g. bash, zsh) scripting
  • Good working knowledge on data manipulation (SQL statements, JSON, NOSQL query, etc.)
  • Good working knowledge on AWS services (EC2, S3, Glue Crawlers, Jobs, Batch, Athena, Lambda, etc.) or equivalent cloud offerings a big plus
  • Good working knowledge on Apache Spark using SQL/Python 
  • Good understanding of the concepts of Datawarehouse, Data Lake/Delta Lake and/or Lakehouse
  • Ability to work with other Leads to foster a culture of collaboration and teamwork

 

Central DE

  • Good knowledge on Linux/Unix Administration
  • CI/CD experience using Terraform a big plus

 

Distributed DE

  • Good working knowledge on data modeling

About Us

Maya is the all-in-one money platform that is bringing Filipinos bolder ways to master their money. It is powered by a unique integrated financial services ecosystem that addresses the ever-evolving needs of today’s generation of money makers through cutting edge technology. 

We lead millions of Filipinos — consumers, businesses, communities, and government agencies alike — into a version of the current digital economy that’s more inclusive, transparent, and empowering than ever. 

We are powered by the country's only end-to-end digital payments company Maya Philippines, Inc. and Maya Bank, Inc. for digital banking services. 

Maya Bank, Inc. and Maya Philippines, Inc. are regulated by the Bangko Sentral ng Pilipinas. https://www.bsp.gov.ph/