Site Reliability Engineer

over 1 year ago
Full time role
Louisville, CO, US... more
Louisville, CO, US... more

Job Description

Site Reliability Engineer 

AMP Robotics is a pioneer and industry leader in artificial intelligence (AI), robotics, and infrastructure for the waste and recycling industry. We apply AI-powered automation to economically and sustainably improve our global recycling system.

Headquartered in Louisville, Colorado, the Denver Post and BuiltIn Colorado have recognized AMP as one of the state's top workplaces. AMP also has operations and career opportunities outside of Atlanta and Cleveland, as well as in Europe. We’re fostering an environment where passionate individuals can grow and create impact. We seek unconventional thinkers to join our mission to enable a world without waste; at AMP, your contributions have meaning and can spur change. With backing from top-tier investors and national recognition including North American Cleantech Company of the Year and Forbes’ most promising AI companies, we’re always seeking ways to better our operations, raising the bar on innovation, and looking to collaborate and improve in what we do. Learn more at AMPRobotics.com.

AMP Robotics is hiring a Site Reliability Engineer reporting to the Site Reliability Engineering Manager.  The role focuses on turning raw, field data into actionable and transparent intelligence - allowing efficient scaling and improved reliability of AMP fleet devices and facilities.   As a Site Reliability Engineer you will build automated tooling to proactively monitor device health and technical process health - as well as lead efforts in commissioning and startup of new AMP facilities.

The Site Reliability Engineering team will work closely with other engineering groups (DevOps, Software Engineering, Facility Ops, QA Engineering, etc.) to define acceptable error rates and a common performance language.  As an Ops facing and reliability focused group, Site Reliability Engineers will focus heavily on reducing product downtime, predictive modeling and proactive alerts for device level failure, standardized root cause analysis, and process engineering best practices.  

Site Reliability Engineers function as the primary liaison between Ops and Sustaining Engineering efforts, increasing interdisciplinary knowledge of product designers, sales teams, project managers, service teams, and production groups with respect to how AMP devices and software actually function in the wild.  The Site Reliability Engineer will be responsible for answering the question “Is the fleet operating within defined performance metrics, without interruption… and can we prove it”. The Site Reliability Engineer will work closely with customers and internal teams to operate AMP Robotics fleet of devices at peak performance. 

As a member of the Site Reliability Engineering team, you will:

  • Serve as Tier 2 escalation point for AMP Field Service Technicians and customers.
  • Work with Software and DevOps groups to automate and execute software rollouts devices operating in production facilities around the world.
    • Perform before and after rollout analysis - validating and verifying application specific performance parity or improvement.
    • Minimize eyes on glass time during fleet-wide rollouts.
    • Continue AMP’s competitive advantage by increasing Software Team feature velocity - become the most efficient portion of the product update process.
  • Develop internally facing proactive alerting, system state dashboards, and historic reporting capabilities ensuring maximum device uptime and operating as expected. 
    • Create and maintain a proactive alerts library - fully defining edge cases and using historic data to develop appropriate severity and priority thresholds.
    • Automate injection of alerts to appropriate reporting platforms (Salesforce, Slack, Jupyter notebooks, etc.) - allowing appropriate teams to act with urgency to resolve issues.
    • Manage real-time dashboards for company-wide access to device specific performance metrics.
    • Work with internal teams and directly with customers to ensure all devices are operating at peak performance
  • Act as the connection point between Production and Engineering teams, ensuring production personnel can effectively deploy software onto AMP devices.
    • Automate warehouse commissioning processes where possible
    • Maintain documentation associated with device level commissioning
    • Provide regular reports to production groups associated with commissioning deviations and commissioning process areas of improvement
  • Provide standardized new application and new product feedback
    • Join the Asset Deployment Team for boots on the ground installations of new applications or new products - documenting anomalies, product improvements, or likely points of failure.
    • Act as a product team resource early in the design process - participating in design reviews and building reliability monitoring processes in parallel with new product design.
    • Perform auditing of startup and calibration parameters - providing the Asset Deployment Team with automated feedback about the correctness of new installations.
  • Assist in efforts to commission and startup new AMP facilities across the US.
    • Drive lessons learned from each startup back to design engineering and asset deployment groups.
  • Example of Associated KPIs
    • Cost to Serve
    • Customer NPS Score
    • Rollout Execution Metrics
    • ECO execution speed

The successful candidate will have the following skills and experience:

Required:

  • 2+ years experience programming with a scripting language like Python (hobby level projects considered).
  • 2+ years of experience with Linux system administration
  • Use of multiple database technologies - mySQL, Postgres, Timescale, InfluxDB
  • Networking exposure - TCP/IP networking (routing & switching, VPNs, managing VPCs, running and interpreting packet captures)
  • Automated build program experience (Ansible Tower, Jenkins, Gitlab CI, etc)
  • Good working familiarity with Docker and Docker-Compose
  • Equally comfortable behind a keyboard or holding a wrench

Preferred: 

  • Experience in the GCP ecosystem. 
  • 3+ years experience with industrial controls & automation and/or process engineering
  • Examples of technical writing
  • Grafana Experience
  • Experience with a wide variety of both batch and continuous process controls
  • Interest in/experience with Machine Learning/Artificial Intelligence and/or robotics
  • Ability to read and/or write any adjacent, moden, coding language
  • Facility operation experience

Education:

  • Bachelor’s Degree in any engineering discipline

Experience: 

  • 2+ years experience in Process Engineering, Controls and Automation Engineering, DevOps, TechOps, Sustaining Engineering, Reliability Engineering, and/or Site Reliability Engineering

Working Conditions/Physical Demands:

The physical demands described here are representative of those that must be met by an employee to successfully perform the essential functions of this job.  Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.  

  • Prolonged periods of sitting at a desk and working on a computer.
  • Must be able to lift up to 15 pounds at times.

 

Working Location(s): 

 

  • Louisville, Colorado

 

Travel Requirements: 

 

  • Up to 40%

 

Affirmative Action/EEO Statement: 

AMP Robotics is an equal opportunity employer. In order to provide equal employment and advancement opportunities to all individuals, employment decisions at the Company will be based on job openings, merit, qualifications, and abilities as required by the position. The Company does not discriminate, and does not permit its employees to discriminate against other employees, applicants, customers, or independent contractors because of:

 

  • Race
  • Color
  • Religion
  • Sex
  • Sexual orientation (including gender identity or expression, including a person's orientation toward heterosexuality, homosexuality, bisexuality, or transgender status, or PeopleCare’s perception thereof)
  • Pregnancy, childbirth, and related conditions
  • Marital status
  • National origin
  • Citizenship
  • Military or veteran status
  • Ancestry
  • Age (40 or over)
  • Disability (including genetic information)
  • Or, any other consideration made unlawful by applicable laws. 

 

Equal employment opportunity will be extended to all persons in all aspects of the employer-employee relationship, including recruitment, hiring, upgrading, training, promotion, transfer, compensation, benefits, discipline, layoff, recall, and termination. 

 

Other duties: 

 

Please note this job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee for this job. Duties, responsibilities, and activities may change at any time with or without notice.

 

We recognize that there is more to work than the day-to-day responsibilities. In addition to a collaborative, high-performing team environment, we’re pleased to offer competitive base salaries; medical, dental and vision insurance; a 401(k) plan; paid time off and sick time; flexible work hours; and the opportunity to quickly accelerate your learning and growth.

Salary & compensation information: $75,000-$95,000

Benefits Information:

  • Medical - The company covers up 85% to 100% of the premium for Cigna healthcare plans depending on the selection. Employees pay the difference in premium if they select a more expensive plan. Up to 75% for dependents. 
  • Dental, Vision, Short and Long Term Disability.
  • 401(k) retirement plan (non-matching).
  • FTO - Flexible Time Off 
  • Nine (9) paid holidays 
  • Equity 

Similar jobs