Senior Systems Operations Engineer
Company: Wells Fargo
Location: Chandler
Posted on: May 28, 2023
|
|
Job Description:
About this role:
Wells Fargo is seeking a Senior Site Reliability Engineer who
enjoys and thrives on solving complex problems through innovation
impacting change at scale in a diverse environment. You will
participate as part of focused team of Site Reliability Engineers
(SREs) introducing and advancing SRE discipline across multiple
applications and customer journeys across the Card Services
Platform. The team will drive technology transformation and
adoption of SRE aligned enterprise capabilities and products,
launch new tooling enablement, automate away complex issues and
integrate with the latest technology. Site Reliability Engineers
leverage their experience as software and systems engineers to
ensure applications onboarded to SRE are available, have full stack
observability, introduce continuous improvement through code and
automation, provide operational insight through analytics,
continuously test, are integrated with CI/D and work with
application teams to ensure products and service we provide are
always on.
This Senior Site Reliability Engineer will be responsible for the
following:
Help drive Site Reliability Engineering capabilities at Wells Fargo
Card Services igniting the practice, principles, and culture
leading by example. Assist in training skilled engineers by growing
the practice within Card Services and partnering with peer platform
embedded SRE teams.
Leverage enterprise capabilities, tools, and innovation improving
availability in a complex ecosystem by evolving observability,
monitoring, logging, synthetic monitoring and chaos
engineering.
Evolve our environment introducing self-healing and autonomic
capabilities solving for complex operational and systemic issues
with precision including building and training models, automating
cognitive processes to improve availability of products we provide
to customers
Automate key SRE metrics and IT Service Operations processes
including customer impact, % availability of critical business
flows, SLO/SLI adherence, error budget, automate incident process
for IT Service Operations through data integrating with unified
communications, and alerting/notification systems.
Share support responsibilities for critical applications and
customer journeys onboarded to SRE including remediation of issues
through Agile, conduct blameless post mortems, root cause analysis
and introduce continuous improvement solving problems once and for
all with the goal of no repeats.
In this role, you will:
Lead or participate in managing all installed systems and
infrastructure within the Systems Operations functional area
Contribute in increasing system efficiencies and lowering the human
intervention time on related tasks
Review and analyze moderately complex operational support systems,
application software, and system management tools to ensure the
highest levels of systems and infrastructure availability
Work with vendors and other technical personnel for problem
resolution
Lead team to meet technical deliverables while leveraging solid
understanding of technical process controls or standards
Collaborate with vendors and other technical personnel to resolve
technical issues and achieve highest levels of systems and
infrastructure availability
Required Qualifications, US:
4+ years of Systems Engineering, Technology Architecture
experience, or equivalent demonstrated through one or a combination
of the following: work experience, training, military experience,
education
3+ years of experience designing and managing Splunk Dashboards,
reports, lookup tables, and summary indexes.
2+ years of database logging and monitoring concepts experience
4+ years of application production support experience
3+ years with one or more Agile tools used for tracking user
stories or backlogs, such as Confluence or Jira
Desired Qualifications:
Experienced with Site Reliability Engineering (SRE)
2+ years of experience with Application performance, monitoring and
optimization using Blazemeter, JMeter, Splunk and AppDynamics
2+ years of experience with scripting languages such as Bash,
PowerShell, Python, Shell, VBScript, or JavaScript
Experience and understanding of AIOPS and related tools such as
MoogSoft or Big Panda
Experience with one or more automation tools such as Ansible.
Experience with Container technologies: Kubernetes, Docker, PKS
Job Expectations:
Flexibility to provide 24/7 support on a rotation basis as
needed.
Ability to work additional hours outside regular business
hours.
We Value Diversity
At Wells Fargo, we believe in diversity, equity and inclusion in
the workplace; accordingly, we welcome applications for employment
from all qualified candidates, regardless of race, color, gender,
national origin, religion, age, sexual orientation, gender
identity, gender expression, genetic information, individuals with
disabilities, pregnancy, marital status, status as a protected
veteran or any other status protected by applicable law.
Employees support our focus on building strong customer
relationships balanced with a strong risk mitigating and
compliance-driven culture which firmly establishes those
disciplines as critical to the success of our customers and
company. They are accountable for execution of all applicable risk
programs (Credit, Market, Financial Crimes, Operational, Regulatory
Compliance), which includes effectively following and adhering to
applicable Wells Fargo policies and procedures, appropriately
fulfilling risk and compliance obligations, timely and effective
escalation and remediation of issues, and making sound risk
decisions. There is emphasis on proactive monitoring, governance,
risk identification and escalation, as well as making sound risk
decisions commensurate with the business unit's risk appetite and
all risk and compliance program requirements.
Candidates applying to job openings posted in US: All qualified
applicants will receive consideration for employment without regard
to race, color, religion, age, sex, sexual orientation, gender
identity, national origin, disability, or status as a protected
veteran.
Candidates applying to job openings posted in Canada: Applications
for employment are encouraged from all qualified candidates,
including women, persons with disabilities, aboriginal peoples and
visible minorities. Accommodation for applicants with disabilities
is available upon request in connection with the recruitment
process.
Drug and Alcohol Policy
Wells Fargo maintains a drug free workplace. Please see our Drug
and Alcohol Policy to learn more.
Company: WELLS FARGO BANK
Req Number: R-206458
Updated: Mon May 22 00:00:00 UTC 2023
Location: CHANDLER,Arizona
Keywords: Wells Fargo, Chandler , Senior Systems Operations Engineer, Other , Chandler, Arizona
Click
here to apply!
|