Thrivent Digital Transformation
At Thrivent, we are dedicated to a digital transformation that prioritizes delivering modern, innovative experiences for our clients, financial advisors, and employees. Our focus includes investing in data and technology, utilizing DevOps practices, and fostering an engineering culture of empowered technical experts.
Our technologists engage in various areas such as cloud native development, digital architecture and integration, automation, cloud data platforms, artificial intelligence, and machine learning. We also maximize platforms like Salesforce, AWS, Microsoft, and other SAAS platforms.
Senior Observability Engineer Role
As a Senior Observability Engineer, you will be a technical expert in Observability and Site Reliability Engineering. Your responsibilities will include ensuring the reliability and performance of software systems, providing expertise in observability engineering, and supporting the growth and mentorship of others in observability practices. The role involves implementing, maintaining, and consulting on observability and monitoring platforms to meet the needs of internal stakeholders.
Duties & Responsibilities:
- Develop and enhance instrumentation of metrics, logs, and traces for observing the health and availability of services.
- Proactively observe systems, networks, and applications to improve stability, security, efficiency, and scalability.
- Participate in rotating on-call incident response on weekdays and weekends.
- Improve operational efficiencies through automation, scripting, AI, and integrations.
- Define best practices for making systems and services measurable and collaborate with teams for implementation.
- Collect, aggregate, and visualize metrics for actionable insights.
- Participate at a technical level in design and code reviews, taking a hands-on role in strategic technical initiatives.
- Partner with Leadership, Architects, Development, and operations teams to ensure product success.
- Evolve the observability platform with thoughtful and strategic leadership.
- Create awareness and implement SRE practices across the enterprise.
Required Job Qualifications:
- Bachelor’s degree in computer science or equivalent work experience.
- 7+ years of experience in engineering environments.
- Sound knowledge of Observability technologies and SRE best practices.
- Sound knowledge of systems design concepts for security and stability.
- Experience in agile and DevOps environments to establish technical standards and practices.
Preferred:
- Understanding of CI tools, primarily Git, and GIT-based version control systems.
- Experience with automation tools like Ansible, Terraform, and GitHub Actions.
- Knowledge of containerization tools and platforms, primarily Kubernetes & Fargate.
- Linux skills, including shell scripting.
- General knowledge of AWS, including EC2, ECS, S3, Lambda, CloudFormation, API gateway, VPC creation, load balancers, auto-scaling groups, CloudWatch Logging, CloudFront, app server configuration, and debugging skills.
- Strong operational experience in a Linux environment.
- Exceptional time management skills and the ability to manage shifting priorities in a fast-paced environment.
- Knowledge and experience with CI/CD tool sets.
Thrivent provides Equal Employment Opportunity (EEO) without regard to race, religion, color, sex, gender identity, sexual orientation, pregnancy, national origin, age, disability, marital status, citizenship status, military or veteran status, genetic information, or any other status protected by applicable local, state, or federal law. This policy applies to all employees and job applicants.
Thrivent is committed to providing reasonable accommodation to individuals with disabilities. If you need a reasonable accommodation, please let us know by sending an email to [email protected] or call 800-847-4836 and request Human Resources.
Apply Now