About this role
Job Details
We're hiring a Site Reliability Engineer II to be a part of our team to build the technology infrastructure for Openly's insurance platform. You will play a crucial role in building, testing, and maintaining the infrastructure and the overall technology ecosystem that powers our insurance products and customer experiences.
Key Responsibilities
Build internal tooling to help other engineers and the rest of the company understand and operate our system
Design and implement security best practices for our team and infrastructure
Reduce toil through automation, including building and maintaining CI/CD infrastructure
Build infrastructure as code using declarative provisioning tools
Develop high signal-to-noise ratio monitoring and alerting policies and technology to help us meet our SLOs
Lead incident response and postmortems
Contribute to important architectural and operational decisions like microservices vs. monoliths, deployment techniques, technologies, policies, etc.
Our stack
Backend: Go & Postgresql
Frontend: Browser-based, VueJS, Webpack, Nuxt &, Tailwind
Research/Data Science: R, ArcGIS, Jupyter Notebooks, & Python
Data: GCP GCS, BigQuery, Composer/Airflow, Cloud Functions, Postgres, SQL, Python, Go, Aiven Debezium and Kafka, Fivetran
Infrastructure: Google Cloud, specifically Cloud Run, Kubernetes, Pub/Sub, BigQuery, and CloudSQL, managed with Terraform. We use GitHub for code hosting, DataDog for monitoring, and CircleCI for running our CI/CD pipelines.
Remote work tools: Slack, Zoom, Donut
Requirements
2+ years of professional/production experience developing and using infrastructure automation tools and techniques
Proven track record of creating improvements in business-critical systems around stability, performance, and scalability
Demonstrated ability to deliver complete systems from start to finish in a reasonable time frame
Understands the consequences of running software in production and are willing to share your knowledge with the rest of the team
Ability to explain complex technical challenges to non-technical audiences
Strong scripting skills in one or more of the following: Python, Go
Experience working with Infrastructure as Code (IaC) tooling, preferably Terraform
