12.Data Engineer (AWS stack)
About PayPay India
Why India ?
To build our Payment services, we got technical cooperation from Paytm (A large payment service company in India). And based on their customer-first technologies , we created and expanded the smartphone payment service in Japan. Therefore, we have decided to establish a development base in India, because it is a major IT country with many talented engineers, as evidenced by the fact that cutting-edge mobile payments can continue to be generated.
OUR VISION IS UNLIMITED
Job Description
PayPay India is looking for Platform Data Engineer who can develop and deploy data pipelines and manage data at scale on mission critical systems.
Main Responsibilities :
- Design, build and maintain ETL/ELT data pipeline to process and ingest data from multiple sources.
- Work with structured and semi/un-structured data.
- Utilizing your skills in engineering best practices to solve complex data problems.
- Advising on specific technologies and methodologies for utilising cloud resources to efficiently ingest and process data quickly.
- Collaborating with teams to ensure that prospective data architecture plans maximize the value of data across the organization.
- Document all data integration processes , workflows and technical & system specifications.
- Ensure compliance with data governance policies , industry standards and regulatory requirements.
Tech Stack :
We select the best combination of tech at times.
| Python, Shell, Go, SQL, PL/SQL
| Aurora, MySQL, TiDB, Redis, DynamoDB
| AWS
| GitHub, Terraform, Jenkins, Ansible, Flyway
| Victoria Metrics, Grafana, Prometheus, Newrelic
| Elasticsearch , Opensearch , Athena , Glue , EMR
Qualifications
- More than 3 years of AWS experience
- 4+ years experience as a Data Engineer or in a similar role
- Strong problem-solving skills and ability to troubleshoot complex data issues
- Hands-on experience with Infra as Code (IaC - terraform)
- Excellent communication and collaboration skills
- Ability to work in a fast-paced, dynamic environment and manage multiple tasks simultaneously
Preferred Qualifications
- Experience designing and building solutions utilising various Cloud services such as EC2 , S3 , EMR , Kinesis , Glue , Athena , Lambda , Redshift , etc.,
- Design and implement data integration workflows and Ensuring data quality and integrity across systems.
- Validate and cleanse structured and semi/un-structured data to maintain high data quality.
- Monitor and optimize the performance of Elasticsearch / Opensearch clusters and Kafka clusters
- . Experience in at least one primary language (e.g. Scala, Python, Java) and SQL (any variant).
- Stay update on cloud security best practices and apply them to the organization's environments.
Nice to Have :
- Design and implement Elastic search / Open search clusters to meet the organization's search and analytics requirements.
- Experience in operating and managing search and analytics engines like Elastic search , Open search , Logstash and Vector.
- Design, implement and manage Kafka-based data pipelines and messaging solutions to support critical business operations and enable real-time data processing.
- Experience in distributed event streaming platform and data processing framework.
- Familiarity with Jenkins, Ansible, Docker, Kubernetes.
- Experience in operating distributed systems.
Remarks
*Please note that you cannot apply for PayPay (Japan-based jobs) or other positions in parallel or in duplicate.
PayPay 5 senses
- Please refer PayPay 5 senses to learn what we value at work.
Working Conditions
Employment Status
- Full Time
Office Location
- Gurugram (Wework)
*The development center requires you to work in the Gurugram office to establish the strong core team.
Apply for this job
*
indicates a required field