Job Description:
At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. We do this by driving Responsible Growth and delivering for our clients, teammates, communities and shareholders every day.
Being a Great Place to Work and providing a culture of caring is core to how we drive Responsible Growth. We are intentional about fostering an inclusive workplace where every teammate has the opportunity to succeed, build a career and contribute to our shared success. This includes attracting and developing exceptional talent, recognizing and rewarding performance, and supporting our teammates’ physical, emotional, and financial wellness through affordable, competitive and flexible benefits.
We value the unique perspectives individuals bring from all backgrounds and career paths - whether shaped by military service, community college education, or a wide range of work and life experiences. These journeys foster resilience, leadership and innovation, strengthening our workforce and positively impact the communities we serve.
Bank of America is committed to an in-office culture that supports collaboration, engagement, and career development. Our approach includes clear in-office expectations, while providing an appropriate level of flexibility based on role-specific responsibilities and business needs.
At Bank of America, you can build a successful career with opportunities to learn, grow, and make an impact. Join us!
Job Description:
We are seeking Senior Site Reliability Engineers (SREs) to design, build, and maintain Bank of America’s cutting-edge AWS Platform.
This role offers the opportunity to work with wide range of technologies and build a unique system-integration skillset which only comes with seamlessly integrating highly complex, disparate services in a hybrid cloud environment, and the chance to be a part of a fun, smart, hardworking, global team that is growing.
Our team fosters a high-quality engineering culture, a customer focused mindset, and builds for scale. We’ll give you room to innovate and present folks with the opportunity to have a significant impact on the evolution of the current and next generation cloud platforms for one of the largest financial institutions globally.
Responsibilities:
As part of this team, you will collaborate with a diverse set of developers, engineers, and architects to deliver infrastructure, policy, observability, and security as code. You will be empowered to eliminate operational toil and to identify and design platform improvements. You will be expected to define and measure service health using SLIs and SLOs, participate in a global support model, and help evolve our highly scalable enterprise platform. highly scalable enterprise platform.
Required Skills:
-
10 years of experience in either SRE, software development, or infrastructure engineering
-
Extensive hands-on experience designing and operating, highly automated, production-grade cloud platforms on a major cloud service provider (Amazon Web Services highly preferred).
-
A deep understanding of Site Reliability Engineering principles and practices
-
Proven hands-on experience in creating complex infrastructure automation using Python, Java/JavaScript, Terraform, or Ansible
-
Strong experience with monitoring tools such as Grafana, Prometheus, and Dynatrace, as well as AWS native tools.
-
Advanced proficiency in implementing a GitOps model with tools such as Terraform and CodePipeline.
-
Strong, foundational understanding of containerization technologies (such as EC2, ECS, EKS), Linux OSs, Networking, and CLIs including shell scripting
-
A proven ability to work independently with minimal supervision within a global team with competing priorities.
-
Excellent interpersonal and organizational communication skills are a must.
Desired Skills:
-
Experience working with a complex controls environment and/or experience with implementing AWS tools and services like ControlTower, GuardDuty, Control Policies and Permission Boundaries.
- Experience working with a complex IAM infrastructure, including Active Directory AWS Identity Center, PingIdentity, Okta, or other SSO solutions.
-
Advanced knowledge of networking (firewalls, DNS, Load Balancing, Proxies, etc.)
-
Experience in implementing, monitoring, and maintaining Bedrock, SageMaker, or other AI platforms.
-
Experience in implementing, monitoring, and maintaining a highly scalable and resilient enterprise platforms on GCP, Azure, or other Cloud service Provider using native services related to compute, storage, networking, security, and observability.