Watch this video to learn more about Citi
Job Details
Senior Data Engineer
We are seeking a highly skilled and motivatedÌýSenior Data EngineerÌýto design, develop, and implement cutting-edge data engineering solutions using modern big data and cloud technologies. In this role, you will collaborate with product owners, data scientists, analysts, and technologists to deliver scalable, high-performance data products in an agile and collaborative environment. You will also play a key role in migrating legacy workloads to the cloud, optimizing data pipelines, and mentoring team members on best practices in data engineering.
Key Responsibilities
- Design and develop scalable big data solutions using platforms like Hadoop, Snowflake, or other modern data ecosystems.
- Collaborate with domain experts, product managers, analysts, and data scientists to build robust and efficient data pipelines.
- Lead the migration of legacy workloads to cloud platforms (AWS, Azure, or GCP) while ensuring seamless integration and optimization.
- Develop and implement cloud-native solutions for data processing and storage.
- Partner with data scientists to build data pipelines from heterogeneous sources and provide engineering support for data science applications.
- Enable advanced analytics and machine learning workflows by delivering high-quality data pipelines.
- Implement CI/CD pipelines to automate data engineering workflows across cloud and on-premises platforms.
- Drive automation to improve efficiency and reduce manual intervention in data processes.
- Research and evaluate open-source technologies and recommend their integration into the data platform to enhance functionality and scalability.
- Act as a technical expert and mentor team members on big data and cloud technologies.
- Define and enforce coding standards, reusable components, and consistent patterns for data engineering processes.
- Convert SAS-based pipelines into modern frameworks like PySpark, Scala, or Java for execution on Hadoop and non-Hadoop ecosystems.
- Optimize big data applications for performance and scalability across platforms.
- Analyze evolving business requirements and recommend enhancements or alternatives to current systems.
- Evaluate new IT developments and industry standards to ensure the data platform remains cutting-edge.
- Foster a collaborative and high-performing team environment.
- Ensure compliance with applicable laws, regulations, and organizational policies.
- Apply sound ethical judgment and escalate control issues transparently.
Qualifications
- 8+ years of experience with Hadoop (Cloudera) and big data technologies.
- Advanced knowledge of the Hadoop ecosystem, including HDFS, MapReduce, Hive, Pig, Impala, Spark, Kafka, Kudu, and Solr.
- Proficiency in Java, Python, or Scala.
- Hands-on experience with Spark programming (PySpark, Scala, or Java).
- Familiarity with Apache Beam is a plus.
- Experience with cloud platforms like AWS, Azure, or GCP.
- Proven ability to deploy and manage data solutions on cloud platforms.
- Expertise in designing and developing data pipelines for ingestion, transformation, and processing.
- Experience with Snowflake or Delta Lake is a strong advantage.
- Hands-on experience with containerization tools like Docker and Kubernetes.
- Proficiency in DevOps practices, including source control, CI/CD, and automated deployments.
- Experience with Python libraries for machine learning and data science workflows.
- Strong knowledge of data structures, algorithms, distributed storage, and compute systems.
- 1+ year of SAS experience preferred.
- 1+ year of Hadoop administration experience preferred.
- Strong problem-solving and analytical skills.
- Excellent interpersonal and teamwork abilities.
- Proven leadership experience, including mentoring and managing a team of data engineers and analysts.
- A proactive, "can-do" attitude for solving complex business problems.
Education
- Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent experience).
This revised job description is concise, well-structured, and highlights the key responsibilities, qualifications, and benefits of the role. It is tailored to attract experienced data engineers with expertise in big data, cloud platforms, and leadership.
------------------------------------------------------
Job Family Group:
Technology------------------------------------------------------
Job Family:
Data Science------------------------------------------------------
Time Type:
Full time------------------------------------------------------
Most Relevant Skills
Please see the requirements listed above.------------------------------------------------------
Other Relevant Skills
For complementary skills, please see above and/or contact the recruiter.------------------------------------------------------
Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.
Ìý
If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review .
View Citi’s and the poster.
About Citi Working at Citi is far more than just a job. A career with us means joining a team of more than 200,000 dedicated people from around... Read more