Senior Back-End / Data Engineer - Innovation team
Senior Back-End / Data Engineer – Innovation team
About Us
Power Factors is accelerating the green energy transition by providing advanced analytics and AI insights to operators of renewable energy assets. Our SaaS platforms are used to manage over 250 GW of wind, solar, hydro, and energy storage projects globally. By driving down operational costs and increasing revenue, we are tackling one of the world's most important challenges: making renewable energy the world's leading source of power. Our vision is to create a sustainable world powered by renewable energy. Our mission is to fight climate change with code.
About the Role
We are looking for a Senior Backend Engineer to join the Innovation team— This role is the connective tissue of the . You will own the data foundation that makes state-of-the-art model training possible, the inference service that makes model outputs usable, and the platform integration that puts those outputs in front of pilot customers. From production ETL through to shadow-mode validation pipelines, you will be the engineer who keeps every track moving. The role is based in Montreal and is critical-path from day one.
About You
You are a senior backend engineer with serious data engineering chops and a track record of shipping production ML-adjacent systems. You have built pipelines that don't just move data — they guarantee its quality, version it for reproducibility, and hold up under real-world operational chaos. You understand that a model is only as good as the data it trains on, and that an inference service is only as useful as the product it integrates into. You are energised by breadth and ownership, and you are comfortable carrying a heavy load on a small team where your decisions have direct programme-level consequences.
Required Qualifications
- 6+ years of backend and data engineering experience, with a proven track record of shipping production systems
- Production-grade ETL/ELT pipeline design at scale: idempotency, retry logic, backfill jobs, incremental loading, and cost-controlled warehouse compute
- Schema design and data modelling across heterogeneous sources — experience reconciling signals from disparate systems into a canonical, queryable format
- Data quality engineering: automated quality gates (sparsity, flatline detection, outlier flagging, freshness checks), alerting pipelines, and dataset versioning for ML reproducibility
- API design and development: RESTful inference services with contract testing, latency and throughput budgeting, and structured observability (logs, metrics, traces)
- Experience integrating ML model outputs into SaaS product surfaces: auth and authorisation, customer isolation, feature flag management
- Cloud infrastructure proficiency (AWS preferred), containerisation (Docker, Kubernetes), and CI/CD pipeline ownership
- Python and SQL as core tools; hands-on experience with modern warehouse technologies (Snowflake, BigQuery, or Databricks)
- Pipeline orchestration with Airflow, Prefect, Dagster, or equivalent
- Excellent communication skills and fluency in English (written and verbal)
Beneficial Qualifications
- Experience with time-series, IoT, or industrial sensor data (SCADA systems, irregular sampling, high-missingness signals) — a significant advantage
- Familiarity with streaming data platforms (Kafka, Kinesis, or Pub/Sub) for real-time or near-real-time ingestion
- Experience designing and managing feature stores for ML training and serving
- Renewable energy domain knowledge: understanding of wind and solar asset operations and the data they produce
- Experience standing up shadow-mode or A/B comparison pipelines for ML systems — running live inference in parallel with an existing path and logging predictions against actuals
- Multi-tenancy and platform integration experience in B2B SaaS products
- Knowledge of data lake architectures and open table formats (Delta Lake, Iceberg, Parquet)
- Familiarity with MLOps practices: model registry conventions, retraining triggers, drift monitoring
Your Responsibilities
Data Foundation
- Design and build the production ETL pipeline from source systems to warehouse and feature store at fleet scale, covering thousands of wind and PV sites across multiple OEMs
- Own canonical signal schema design across wind and PV asset classes and OEMs — the deepest technical unknown in the programme and the foundation everything else depends on
- Implement automated data quality gates: missingness, flatline detection, outlier flagging, and freshness checks, with alerting that generates tickets automatically
- Implement dataset versioning sufficient to reproduce every trained model from scratch
- Build and maintain backfill jobs, idempotency guarantees, and retry logic that survive mid-run failure without duplicating data
- Govern storage and compute costs on the warehouse from day one
Inference Service & API
- Build the batch and on-demand inference API with contract tests, sized for fleet-wide daily runs
- Establish latency and throughput baselines; own the cold-start and model-loading strategy
- Instrument the service with structured logs and metrics from the outset
Platform Integration
- Integrate forecasts into the Power Factors product platform: auth and authorisation with customer isolation, observability hooked into the existing stack, and feature flags per customer and per site
- Build and maintain the shadow validation pipeline: run live inference in parallel with the existing forecast path, log predictions and actuals, and produce weekly validation reports broken down by asset class, OEM, and region
- Support the pilot customer rollout: enable the product for friendly customers behind flags, own incoming data and integration tickets during the pilot window
Collaboration
- Work closely with the ML Engineer to align on data quality requirements, feature store interfaces, and the handoff between the data platform and training pipeline
- Partner with the Tech Lead during platform integration to ensure a clean, maintainable integration surface
- Contribute to architectural decisions across the programme and document data flows, schemas, and pipeline runbooks to a standard that supports the broader team
Why Join Us
- A humble cause with a clear purpose – you will help us fight climate change with code every day at work.
- Own the full data and serving stack for a greenfield foundation model programme
- Work with passionate experts – join a team of top talents in renewable energy and data science.
- Friendly and uplifting atmosphere – we value kindness, respect, and collaboration.
- Based in Montreal, with hybrid flexibility.
We are an Equal Opportunity Employer
Power Factors is committed to building a diverse and inclusive team. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation, gender identity, national origin, disability, or veteran status.