Senior Data Engineer (Remote, Part-Time)
India
Welo Data - AI Services – Data Validation /
Remote
Welocalize is seeking a Senior Data Engineer with world-class Python expertise and a sharp eye for data quality, engineering rigor, and visualization fidelity.
In this role, you will create realistic datasets, write intuitive prompts, and develop high-quality “golden plots” to power insightful data tasks. You'll bridge technical precision with real-world plausibility—crafting examples that meet strict quality rubrics and correctness criteria.
Role Details
Location: Remote
Commitment: Part-time, flexible hours
Compensation: Competitive hourly rate
🔍 Key Responsibilities
- Design and build realistic toy/dummy datasets at varying complexity levels (simple, moderate, complex).
- Ensure datasets reflect real-world scenarios while remaining clean, reproducible, and well-structured (CSV format).
- Write concise, natural-language prompts (<40 words) tailored for business analysts.
- Ensure prompts are grammatically precise and align with project style (non-technical, not overly directive, no raw metric requests).
- Reproduce target visualizations with aesthetic and analytical fidelity.
- Write clean, reproducible Python code using libraries like pandas, matplotlib, seaborn, and plotly.
- Maintain high-quality code standards: well-commented, organized, and reproducible.
- Define correctness criteria tailored to open-ended insight prompts.
- Ensure criteria are flexible, accurate, and avoid assuming fixed solutions.
- Provide supporting documentation with expected values and metrics (e.g., means, correlations, test results).
- Rigorously check each example against quality rubrics (correctness, completeness, clarity, justification).
- Maintain a high standard of accuracy across data, prompts, and visualizations.
📊 Dataset Design & Curation
✏️ Prompt Engineering
📈 Golden Plot Creation
✅ Correctness Criteria & Documentation
🕵️ Quality Assurance
Required Qualifications
- 10+ years of professional experience in data engineering or applied data science.
- Expert-level proficiency in Python (data wrangling, visualization, statistical testing).
- Deep understanding of data modeling, prompt crafting, and reproducible workflows.
- Exceptional attention to detail and quality control.
- Proven success in high-impact technical environments.
Preferred Qualifications
- Experience with statistics, hypothesis testing, and storytelling with data.
- Background in producing training data, developer tools, or reproducible research.
- Strong writing skills for crafting precise prompts and correctness criteria.
If you're a data expert who thrives on quality, clarity, and impactful engineering, we’d love to hear from you.