- Company Name
- Sylorion SAS
- Job Title
- Senior Data Developer
- Job Description
-
**Job Title**
Senior Data Developer
**Role Summary**
Design, develop, maintain, and optimize data ingestion pipelines on a production Cloudera Hadoop platform. Lead technical initiatives, perform code reviews, mentor junior staff, and provide Level‑2 support while ensuring performance and reliability of the data lake ecosystem.
**Expectations**
- Minimum 7 years of data engineering experience in a production environment.
- Deep proficiency with Python, Spark, Hive, and advanced Shell scripting.
- Strong SQL expertise and performance tuning skills.
- Experience reading and analyzing YARN logs, and optimizing Hive queries.
- Proven ability to structure projects using Jira, documentation, and process frameworks.
- Excellent communication, pedagogical, and diplomatic skills for cross‑functional collaboration.
**Key Responsibilities**
- Evolve and maintain data lake ingestion pipelines, including reuse of existing jobs.
- Optimize pipeline performance, with a focus on Hive query efficiency.
- Develop complex data extraction routines using advanced SQL.
- Automate repetitive tasks through advanced Shell scripting and scheduling.
- Lead technical design, conduct code reviews, and guide interns or new hires.
- Provide Level‑2 support to internal users and troubleshoot platform issues.
- Reverse‑engineer legacy Hadoop architecture to align with current standards.
- Guarantee adherence to deadlines, maintainability, and platform operability.
- Collaborate with the Technical Direction and Data Governance teams.
**Required Skills**
- Python (PySpark), Spark, Hive, Impala, and SQL (expert level).
- Advanced Shell scripting on Linux, YARN log analysis.
- Performance tuning and job optimization.
- Familiarity with Cloudera Hadoop ecosystem, Jupyter, VBA (optional).
- Project structuring using Jira, documentation, and process management.
- Strong communication, teaching, and stakeholder management.
**Required Education & Certifications**
- Bachelor’s or Master’s degree in Computer Science, Data Engineering, or related field.
- Certifications in Hadoop/Cloudera, Spark, or other big‑data technologies are a plus.
---