Broadridge Financial Solutions Site Reliability Engineering Manager (JR1026356) in Newark, New Jersey
Broadridge Financial Solutions, Inc. (BR) (https://finance.yahoo.com/quote/br?ltr=1) , a $4 billion global Fintech leader and part of the S&P 500® Index, is a leading provider of investor communications and technology-driven solutions to banks, broker-dealers, asset and wealth managers and corporate issuers. At Broadridge, we do well by doing good. Our unique culture is guided by the Service-Profit Chain—the idea that success is mutual, directly connecting employee engagement, client satisfaction, and the creation of stockholder value. We enable better financial lives by powering investing, governance, and communications for our clients, their customers, and the financial services industry.
Are you seeking a position within a growing company? Broadridge is hiring! Our mission is to attract, develop and retain outstanding talent. Being a place where exceptionally driven and hardworking people want to work is how we deliver award-winning services to our customers and ultimately build customer value. We’re seeking an SRE Manager to join our Site Reliability team. You will have the opportunity to provide leadership defining and refining engineering processes.
Implement GTO SRE strategy across a given Portfolio of products and associated SRE teams within a business segment.
Manage SRE teams so that they
- Have effective capabilities for ensuring production uptime and stability as well as the observability, reliability, availability, performance, capacity planning and operational support for the products across GTO.
Have effective processes for continuous improvements to improve Service Level Objectives (SLO) and mean time to identification (MTTI), mean time to resolution (MTTR), and mean time to failure (MTTF).
Can effectively engage in incident management and have well defined procedures for the identification of relationships between processes and events, and their root cause.
Focus on automation; in the context of self-healing, auto-remediation, removing manual toil, orchestration tooling and infrastructure-as-code patterns.
Ensure that the systems can withstand 'chaos engineering' practices and can fail gracefully when services are degraded
Ensure the means exist to quickly recover a degraded service (instrumentation, runbooks, tooling etc).
Ensure adequate instrumentation and alerting exists to spot leading indicators of an impending incident in the system; as well as in systems on which the platform depends.
Provide leadership defining and refining engineering processes as the teams grow. Motivate, lead and develop a team of talented engineers.
Drive SRE education across the team to improve quality and reliability
Directly collaborate with Portfolio technology and product stakeholders to understand their strategies and needs and incorporate them into SRE backlog. Partner with them to ensure effective communication regarding service reliability, performance and superior customer experience
Provide best practice SRE consideration to Portfolio product Architecture and Application Development teams so that stability and reliability are incorporated into new solutions
More than 15 years of relevant working experience with a strong technical hands-on experience
Strong experience with automation and orchestration of applications and infrastructure components
Constant improvement approach
Excellent knowledge in distributed architecture, Cloud, microservices, SOA, IaaS and PaaS as related to design patterns
Ability to identify potential design issues and present valid solutions/options during the design phases
Experience in Agile and Test-Driven Development (TDD) methodologies
Experience in leading SRE teams and/or DevOps functions or similar
Understanding what it takes to support applications and its related infrastructure in a production environment (Service Level Agreements)
Experience growing and building highly effective teams.
Experience collaborating across organizational boundaries, forming alliances with other members of the Portfolio management leadership team and building bridges that support functional as well as company goals.
Ability to identify trends and promote solutions that solve challenges efficiently across the organization
Highly-collaborate who can build strong relationships at all levels of the technology and business organizations
SRE Sprint planning and ability to prioritize tasks to meet the sprint
Working through the definition, design, release and run cycle of software products to markets
Experience with DevOps, ITIL, Cloud Services, IT Infrastructure and Operations, including environment stand-up, server builds, firewalls, security and regulatory compliance.
Experience of any object-oriented language
Proficiency working in Unix/Linux environments.
Experience with IBM MQ, Kafka, Postgres
Experience with Amazon AWS solutions capabilities such as EC2, EBS, RDS, S3, Cloud Formation, Dynamo DB, Route 53, IAM, ELB, CloudWatch, Lambda, Kinesis etc.
Experience of Logging, Monitoring and Alerting framework for hybrid cloud or third-party services using AppDynamics, Splunk, Data Dog and CA APM.
Experience with Atlassian toolset JIRA/Confluence and agile development practices
Experience with tools such as Jenkins and Ansible, GIT, Maven, Nexus, Chef, Docker, Terraform, Kubernetes, Pivotal Cloud Foundry, Concourse
Broadridge is an equal opportunity employer and makes employment decisions without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability, age or any other protected status. "Everyone Benefits from Diversity & Inclusion. Diverse & Inclusive Teams Drive Growth." US applicants: Click here (https://www.dol.gov/ofccp/regs/compliance/posters/ofccpost.htm) to view the "EEO is the Law" poster. If you are a qualified individual with a disability or a disabled veteran, you may request a reasonable accommodation in the event you are unable or limited in your ability to use or access the Companys career webpage as a result of your disability. You may request a reasonable accommodation(s) by calling 888-237-7769 or by sending an email to BRcareers@broadridge.com