Data Scientist
San Diego Community Power, San Diego, CA, United States
About the role:
San Diego Community Power (SDCP) is seeking a seasoned Data Scientist to join our growing team of data transformation experts who will be responsible for collecting, organizing, and mining data to build AI and data science initiatives to deliver clean energy solutions that help our local and global communities. A key priority of this role will be to lead efforts in identifying key data sources and development of statistical modeling and machine learning solutions that deliver impactful results for SDCP. The focus of this role is to deliver predictive and prescriptive algorithms and models using structured and unstructured datasets. The Data Scientist in this role is responsible for statistical analysis, deep learning modeling, data wrangling, mathematics algorithms, computer vision models, visualization of data and outcome, and validations of model performance. This role will collaborate with agency leaders and staff to understand AI, forecasting and predictive modeling needs.
This role will report to the Director of Data Analytics and IT Services.
Qualifications
- Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases.
- Experience developing new machine learning models, model optimization and deployment of modeling solutions.
- Experience performing trend analysis on internal and external data to develop business solutions.
- Strong analytic skills related to working with unstructured datasets.
- Build processes supporting data transformation, data structures, metadata, dependency and workload management.
- A self-starter with high bias for action.
- Experience in data gathering and model classification techniques.
- Experience with processing, cleansing and verifying the integrity of data for exploratory analysis.
- Minimum of five (5) years of professional experience in a Data Scientist role, preferably in the energy industry with a graduate degree or PhD in Engineering, Data Science, Statistics, or another quantitative field.
- Experience with developing data architecture and solution architecture.
- Experience with big data tools and technical architecture.
- Experience with AWS, Azure or Google Cloud services.
- Experience with Python, Sagemaker, Panda, Plotly, Pytorch, Flask, Tensorflow or Keras and similar ML frameworks.
- Knowledge of Computer Vision, Convolutional Neural Networks (CNNs), Deep Learning, Large Language Models, and Natural Language Processing.
- Knowledge of data ingestion and data transformation methodologies including batch, micro-batching and real-time data ingestion.
- Experience with GitHub or similar code repositories.
Responsibilities
- A key priority of this role will be to lead efforts in identifying key data sources and development of statistical modeling and machine learning solutions that deliver impactful results for SDCP.
- The focus of this role is to deliver predictive and prescriptive algorithms and models using structured and unstructured datasets.
- The Data Scientist in this role is responsible for statistical analysis, deep learning modeling, data wrangling, mathematics algorithms, computer vision models, visualization of data and outcome, and validations of model performance.
- This role will collaborate with agency leaders and staff to understand AI, forecasting and predictive modeling needs.
- This role will report to the Director of Data Analytics and IT Services.
- Serve as lead data scientist and assist in driving data science and AI initiatives for the enterprise.
- Collaborate with data analytics, IT and product teams to understand and document needs for predictive and prescriptive models.
- Design and develop innovative statistical and mathematical models to conduct exploratory data analysis.
- Research new modeling techniques and stay current with industry, open-source and academia.
- Develop feature selection, building and optimize classifier using machine learning methods.
- Develop energy load forecasting models using time-series data.
- Extending company’s data sources with third party sources for comprehensive and holistic information building.
- Conduct ad-hoc data analysis and presentations / telling the data story as needed.
- Create automated way to detect anomalies and monitor model performance.
- Collaborating with other internal teams, data analysts, and other stakeholders to understand and optimize how data can be leveraged to meet business needs.
- Performs other related duties and responsibilities as required.
Benefits
- Salary Range: The position salary range is: $124,984 to $136,197; with exact compensation to be determined by SDCP, depending upon experience.
- Insurance: SDCP offers group health benefits, including medical, vision, and dental insurance, for eligible FT employees.
- Also provided is a $100,000 Life & AD&D policy, STD and LTD coverage that is 100% paid by SDCP.
- Retirement: SDCP offers a 457(b) plan for employee contributions and contributes 10% of eligible compensation to the employee’s Money Purchase Plan.
- Paid Time Off: 11 holidays per year + paid winter holiday (between 12/24-12/31), 160 hours of accrued paid time off per year (increases with time in service), and 96 hours per year of accrued paid sick leave.
Company information
San Diego Community Power is a not-for-profit public agency bringing you cleaner energy at competitive rates. We provide reliable, affordable electricity to nearly 1 million customer accounts in the Cities of San Diego, Chula Vista, Encinitas, La Mesa, Imperial Beach and National City, as well as the unincorporated communities of San Diego County.
#J-18808-Ljbffr