Senior Systems Operations Engineer
Company: Wells Fargo
Posted on: May 28, 2023
About this role:
Wells Fargo is seeking a Senior Site Reliability Engineer who enjoys and thrives on solving complex problems through innovation impacting change at scale in a diverse environment. You will participate as part of focused team of Site Reliability Engineers (SREs) introducing and advancing SRE discipline across multiple applications and customer journeys across the Card Services Platform. The team will drive technology transformation and adoption of SRE aligned enterprise capabilities and products, launch new tooling enablement, automate away complex issues and integrate with the latest technology. Site Reliability Engineers leverage their experience as software and systems engineers to ensure applications onboarded to SRE are available, have full stack observability, introduce continuous improvement through code and automation, provide operational insight through analytics, continuously test, are integrated with CI/D and work with application teams to ensure products and service we provide are always on.
This Senior Site Reliability Engineer will be responsible for the following:
Help drive Site Reliability Engineering capabilities at Wells Fargo Card Services igniting the practice, principles, and culture leading by example. Assist in training skilled engineers by growing the practice within Card Services and partnering with peer platform embedded SRE teams.
Leverage enterprise capabilities, tools, and innovation improving availability in a complex ecosystem by evolving observability, monitoring, logging, synthetic monitoring and chaos engineering.
Evolve our environment introducing self-healing and autonomic capabilities solving for complex operational and systemic issues with precision including building and training models, automating cognitive processes to improve availability of products we provide to customers
Automate key SRE metrics and IT Service Operations processes including customer impact, % availability of critical business flows, SLO/SLI adherence, error budget, automate incident process for IT Service Operations through data integrating with unified communications, and alerting/notification systems.
Share support responsibilities for critical applications and customer journeys onboarded to SRE including remediation of issues through Agile, conduct blameless post mortems, root cause analysis and introduce continuous improvement solving problems once and for all with the goal of no repeats.
In this role, you will:
Lead or participate in managing all installed systems and infrastructure within the Systems Operations functional area
Contribute in increasing system efficiencies and lowering the human intervention time on related tasks
Review and analyze moderately complex operational support systems, application software, and system management tools to ensure the highest levels of systems and infrastructure availability
Work with vendors and other technical personnel for problem resolution
Lead team to meet technical deliverables while leveraging solid understanding of technical process controls or standards
Collaborate with vendors and other technical personnel to resolve technical issues and achieve highest levels of systems and infrastructure availability
Required Qualifications, US:
4+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
3+ years of experience designing and managing Splunk Dashboards, reports, lookup tables, and summary indexes.
2+ years of database logging and monitoring concepts experience
4+ years of application production support experience
3+ years with one or more Agile tools used for tracking user stories or backlogs, such as Confluence or Jira
Experienced with Site Reliability Engineering (SRE)
2+ years of experience with Application performance, monitoring and optimization using Blazemeter, JMeter, Splunk and AppDynamics
Experience and understanding of AIOPS and related tools such as MoogSoft or Big Panda
Experience with one or more automation tools such as Ansible.
Experience with Container technologies: Kubernetes, Docker, PKS
Flexibility to provide 24/7 support on a rotation basis as needed.
Ability to work additional hours outside regular business hours.
We Value Diversity
At Wells Fargo, we believe in diversity, equity and inclusion in the workplace; accordingly, we welcome applications for employment from all qualified candidates, regardless of race, color, gender, national origin, religion, age, sexual orientation, gender identity, gender expression, genetic information, individuals with disabilities, pregnancy, marital status, status as a protected veteran or any other status protected by applicable law.
Employees support our focus on building strong customer relationships balanced with a strong risk mitigating and compliance-driven culture which firmly establishes those disciplines as critical to the success of our customers and company. They are accountable for execution of all applicable risk programs (Credit, Market, Financial Crimes, Operational, Regulatory Compliance), which includes effectively following and adhering to applicable Wells Fargo policies and procedures, appropriately fulfilling risk and compliance obligations, timely and effective escalation and remediation of issues, and making sound risk decisions. There is emphasis on proactive monitoring, governance, risk identification and escalation, as well as making sound risk decisions commensurate with the business unit's risk appetite and all risk and compliance program requirements.
Candidates applying to job openings posted in US: All qualified applicants will receive consideration for employment without regard to race, color, religion, age, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.
Candidates applying to job openings posted in Canada: Applications for employment are encouraged from all qualified candidates, including women, persons with disabilities, aboriginal peoples and visible minorities. Accommodation for applicants with disabilities is available upon request in connection with the recruitment process.
Drug and Alcohol Policy
Wells Fargo maintains a drug free workplace. Please see our Drug and Alcohol Policy to learn more.
Company: WELLS FARGO BANK
Req Number: R-206458
Updated: Mon May 22 00:00:00 UTC 2023
About this role:
Keywords: Wells Fargo, Chandler , Senior Systems Operations Engineer, Other , Chandler, Arizona
here to apply!