
C2C requirements
Senior Data Analyst/Engineer (Machine Learning) :: Con Edison :: New York
Databricks Data Engineer certification is MOST IMPORTANT
Contract
Job description
Required Certifications: Databricks Data Engineer certification is MOST IMPORTANT but can be completed post-onboarding. Google Cloud certification is also acceptable post-hire.
SQL: THIS IS HER TOP TOP MUST HAVE!! Must be able to write and execute SQL queries independently. Strong hands-on ability is required, including data extraction and basic to intermediate transformations.
PySpark (Highest Priority): Core requirement of the role. Candidates must have strong hands-on experience building data pipelines and performing large-scale data processing.
Python / Pandas: Required for data cleaning, transformation, and analysis.
Machine Learning (20% of role): Candidates must already have hands-on ML experience (classification, regression, clustering, model evaluation). No training will be provided on ML concepts.
Data Analysis Focus: Majority of the role involves SQL-based data extraction and PySpark/Pandas-driven analysis and insights
ML + Analytics Integration: Candidates should understand end-to-end workflows combining SQL, PySpark, and ML.
Team Structure: Approximately 10 Data Engineers and 6 Data Scientists within the team.
Please ensure candidates are screened heavily for hands-on PySpark, PANDA, SQL coding ability, and real ML implementation experience before submission.
To apply for this job email your details to imran.ansari@anviktek.com