Logo
ApTask

Machine Learning Engineer

ApTask, Cupertino, California, United States, 95014


Salary- $135K + BenefitsJob Description:Responsibilities:

Hands-on experience in Azure cloud technology and Terraform.Design and implement monitoring and alerting strategies to enforce application SLAs.Develop, test, and debug automated tasks (Apps, Systems, Infrastructure).Troubleshoot priority incidents, facilitate blameless post-mortems.Work with development teams throughout the software life cycle ensuring sustainable software releases.Perform analytics on previous incidents and usage patterns to better predict issues and take proactive actions.Define, drive adoption and enforcement of service level objectives at both service and experience levels.Influence, design and create new architectures, standards and methods for large-scale enterprise systems.Build and drive adoption for greater self-healing and resiliency patterns.Lead and participate in performance tests; identify bottlenecks, opportunities for optimization, and capacity demands.Experience in managing and scaling distributed systems in a public, private, or hybrid cloud environment.Advise design reviews, operational reviews, and the deployment of highly available infrastructure.Make monitoring and alerting meaningful to support our uptime goals.Operate and maintain infrastructure in multiple public clouds, especially Azure Cloud.Experience with deploying, supporting, and monitoring new and existing services, platforms, and application stacks.Experience with scale testing, disaster recovery, and capacity planning.Cloud operations experience in a large-scale 24x7 production environment.Manage, innovate and create programs, new software, analytics that drive improvements to the availability, scalability, latency, and efficiency of Client Applications and services.Essential Functions:

Designs and writes complex code in several languages relevant to our existing product stack, with a focus on automation.Configures, tunes, maintains and installs applications systems and validates system functionality.Monitors and fine-tunes applications system to achieve optimum performance levels and works with hardware teams to resolve issues with hardware and software.Develops and maintains department's knowledge database containing enterprise issues and possible resolutions.Develops models of task problem domain for which a system will be designed or built.Uses models, hypotheses, and cognitive analysis techniques to elicit real problem-solving knowledge from the experts.Mediates between the expert and knowledge base; encodes for the knowledge base.Acts as subject matter expert for difficult or complex application problems requiring interpretation of AI tools and principles.Researches and prepares reports and studies on various aspects of knowledge acquisition, modeling, management, and presentation.Develops and maintains processes, procedures, models, and templates for collecting and organizing knowledge into specialized knowledge representation programs.Acts as vendor liaison for products and services to support development tools.Maintains the definition, documentation, training, testing, and activation of Disaster Recovery/Business Continuity plans.

#J-18808-Ljbffr