We are looking for a Senior DevOps Engineer to assist in building risk management solutions as part of our Cybersecurity Risk Platform. We are seeking a senior engineer who can work closely with our Platform Team, other members of the Engineering Operations Teams, and customers to efficiently deploy scalable and reliable software systems. Members of our team enjoy a collaborative work environment focused on delivering value to large-scale enterprise customers.
In this role, you will assist the Development and Quality Assurance Teams in optimizing our SDLC, as well as work with Cloud Operations and Site Reliability Engineering Teams in the deployment, monitoring, and ongoing management of customer systems. You must love efficiently delivering, managing, and optimizing reliable cloud infrastructure and services by using DevOps principles applied to cloud-based architectures.
The ideal candidate must have a background in deploying/managing the operations of large-scale SaaS software solutions and improving developer productivity. You will need to be able to build and manage CI/CD pipelines, use IaC tooling to deploy infrastructure, be comfortable using monitoring tools to understand the operating environment, and respond to service interruptions. We are seeking an engineer who is passionate about delivering high-quality solutions used by some of the world's largest enterprises. The position reports to a Manager of Engineering Operations or above.
If you are looking to build your career with an exceptional team, be part of building something great, and make an impact, you may have found it here!
WHAT YOU WILL DO
Collaborate with the development teams on how we develop, test, and deploy the platforms
Maintain CI/CD pipelines to ensure that our build/deployment pipelines are automated to remove inefficiencies, toil, risk, and waste (cost)
Automate mundane tasks so that all of the engineering teams focus more on strategic development and driving business value
Work with developers to build highly observable systems that proactively report on system performance and reliability
Resolve critical customer issues that improve customer satisfaction and renewal rates
Work with the security and compliance teams to identify and appropriately mitigate risks
Participate as a First Responder in on-call, incident response, and incident management
Champion processes that support team-led work planning and value delivery
Participate in planning sessions that ensure objectives are well-understood so that standards and metrics can be established
Assist in building a highly capable team based on great talent identification and recruiting
Help ensure that the engineering team is happy, prolific, and autonomous
Maximize the productivity of our technologies by assisting in the development of technical documentation
Participate in incident postmortems to analyze the root causes of incidents and assess responses
WHAT YOU'LL NEED
5+ years of experience with Linux systems administration and scripting languages like JavaScript, Python, Bash, etc.
5+ years of experience designing and maintaining highly available, secure cloud infrastructure, including application/web firewalls, routing, VPCs, load balancers, auto-scaling, IDS, etc.
3+ years of experience with IaC & configuration management tools like Terraform and Ansible
3+ years of experience with monitoring software such as Datadog, Elastic Stack, New Relic; including implementation of observability, monitoring, and reporting solutions
3+ years of experience building CI/CD pipelines using tools like GitHub, Bitbucket, Jenkins, etc.
Experience maintaining cloud platforms that comply with industry standards and best practices for security and privacy (e.g., SOC2, PCI-DSS, HIPAA, GDPR)
Experience with containers and orchestration via Docker and Kubernetes in a public cloud environment
Excellent verbal and written communication skills
BS in Computer Science, Computer Engineering, Electrical Engineering, or related discipline
AND IDEALLY
Experience developing applications that leverage AI technologies (e.g., ML, Generative AI, NLP, etc.)
Experience working with Neo4J, Yugabyte, Spring Boot, Kubernetes, GCP
Experience developing cybersecurity or IT systems management applications
Experience as a hands-on software engineer
Master's or Advanced Degree in Computer Science, Engineering, or related discipline
Job Types: Full-time, Permanent
Pay: $80,000.00-$90,000.00 per year
Work Location: Remote
MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.