Site Reliability Engineer Lead Jobs in Canada , Employment

By Canadian Natural Resources Limited (CNRL) At Calgary, Alberta, Canada

Contribute to hiring, performance reviews and performance management of your team

Demonstrated leadership experience with cross-functional teams; effective communication, organization & analytical skills

Practitioner of Project Management, Lean Six Sigma (LSS) and Four Disciplines of Execution Excellence (4DX)

Competitive salary, stock options, stock savings plan and benefits

Minimum 7 years’ experience; including 5+ years of Oil & Gas industry and 3+ years of Engineering or Technical role

Certification of Lean Six Sigma Yellow Belt is required and Green Belt or higher is considered an asset

Site Reliability Engineer / Python Developer

By The Blue Mount At Montreal, Quebec, Canada

- Manage Service reliability by managing risk

- Hands on experience using Enterprise Tools such as App Dynamic, Grafana, Splunk, Dynatrace

- Three Tier Support experience with DBs such as IBM, DB2, Sybase, Mongo, Green Plum, KDB

- Generally speaking, practical experience running large scale online systems is always an advantage.

- Knowledge of messaging layer: MQ / CPS / XML

SITE Reliability Engineer We are looking for only permanent residents

Site Reliability Engineer 2 Jobs

By Microsoft At Ottawa, Ontario, Canada

Excellent collaboration, organizational, time management skills.

Knowledge of data governance and data management practices.

3+ years people management experience.

Project Management Institute (PMI) or equivalent Project Management certification.

Improve Customer experience by analyzing signals from various sources, driving RCA's and Service improvements involving bug fixes

Identify and drive requirements for increased customer self-supportability

Site Reliability Specialist Jobs

By CDW Canada At Toronto, Ontario, Canada

5+ years’ experience in Infrastructure Management including a focus on customer service and complex technical assistance

Experience n a Managed Service Provider environment

Translate requirements/stories and mock-ups into fully functional features

Bachelor of Science in Computer Science or related technical degree or equivalent work experience

3+ years’ experience with private and public Cloud technologies

Experience with support ticketing systems and monitoring systems

Supervisor, Site Reliability Jobs

By CDW Canada At Toronto, Ontario, Canada

Experience leading a technical team from a position of direct personnel management (Lead, Supervisor of Manager)

Participate in the change management process, providing oversight on peer reviews and approvals of changes

Demonstrated skill in prioritizing multiple responsibilities, tasks and projects simultaneously including ability to shift priorities

5-7 years of systems design, technical architecture, and support experience

Bachelor of Science in Computer Science or related technical degree or equivalent work experience

Familiarity and deep experience implementing ITIL best practices

Specialist Site Reliability Engineer (Sre)- En

By CN At Calgary, Alberta, Canada

Knowledge And/or Experience In The Following Areas

Review and approve solution requirements for RAM

Determine non-functional requirements and targets for RAM performance

Assign requirements to solutions and products to ensure they support the ability to measure RAM Key Performance Indicators (KPI’s)

Minimum 5-10 years overall work experience

Bachelor’s degree in Electrical Engineering, Mechanical Engineering, Computer Science, Computer Engineering or equivalent degree & experience

Sr. Site Reliability Administrator

By OpenText At Mississauga, Ontario, Canada

Establish and maintain a good relationship with team members, Product Development, Product Management, Customer Service, Client management, and other cross-functional teams.

Experience and knowledge in RDBMS and No-Sql databases such as Oracle, Postgres, MariaDB, and Cassandra.

Uses technical knowledge, creativity, and company practices to drive down occurrences of incidents through the development of proactive monitoring and alerting.

Hands-on experience with cloud infrastructure; Google, AWS, or Azure a plus

Experience with PaaS technologies such as Cloud Foundry, Kubernetes, and Bosh.

Good understanding and operational experience with container technologies like Docker, rkt, Mesos.

Site Reliability Engineer Jobs

By Aquanow At Vancouver, British Columbia, Canada

Configuration Management experience such as Ansible, Chef, Puppet, or similar.

Have excellent problem solving, time management skills, ability to work independently or as part of a team.

Assist with documentation on knowledge base articles, operation manuals and incident and problem notes.

At least three years of experience with the administration, monitoring, and troubleshooting of cloud and on prem services.

Experience in supporting cloud applications including API connections.

Experience with cloud services such as AWS.

Site Reliability Engineer - Linux

By Astreya At Canada

Experience with configuration management and orchestration tools like Ansible and Jenkins

Responsible for securing Linux servers including identity, patch, and access management.

Bachelor’s degree (B.S/B.A) from four-college or university and 5+ years’ related experience and/or training; or equivalent combination of education and experience

Astreya offers comprehensive benefits to all Regular, Full-Time Employees, including:

Keep systems properly configured, updated, healthy, hardened and scaled to meet the project/operational requirements.

Proven experience with Linux server administration – applying patches, ensuring health, right-sizing, architecture standards, and troubleshooting problems.

Lead Data Engineer- Toronto, Canada (On-Site)

By Lorven Technologies Inc. At Toronto, Ontario, Canada

Knowledge & experience with driving IaC concepts within an organization, leveraging Terraform, Ansible, GitHub actions.

Quickly understand organizational dynamics and management priorities, and to be able to work effectively in a fast paced, results driven company.

Conceptualize, design, and implement analytics products that enhance Investments analytics capabilities.

Participate in the development life cycle from start to completion - requirements analysis, development, testing, and deployment.

Ensure architecture will support the requirements of the Investments business.

Develop tools that prepare, transform, combine, and manage structured and unstructured data for use by Investments business users.

Staff Site Reliability Engineer - Remote

By Luxury Presence At Canada

Professional experience with security practices, credential rotations, secrets management systems (ideally Vault)

Oversee the management and optimization of Kubernetes clusters to ensure smooth operations, scalability, and resource utilization

Terraform / ArgoCD / Crossplane.io for resources management

Vault project for secrets and password management

Develop and maintain IaC using tools like Terraform and Crossplane to provision and manage AWS and Kubernetes resources efficiently and consistently

Provide guidance and mentorship to junior SREs and share knowledge with the broader team

Senior Site Reliability Engineer

By Lyft At Montreal, Quebec, Canada

Share knowledge by giving brown bags, tech talks, and evangelizing appropriate tech and engineering best practices.

Share on-call responsibilities with other teammates and own the improvement of the team’s on-call practices

Experience designing, implementing and operating large-scale customer-facing SaaS infrastructure

Experience with high level programming languages (Python, Go, Java, etc.) and declarative languages (eg. Terraform).

Experience working with public cloud platforms (eg. AWS, Google Cloud Platform, Microsoft Azure, etc.) and container orchestrators (eg. Kubernetes)

Strong troubleshooting and debugging skills

Sr. Site Reliability Administrator

By OpenText At Waterloo, Ontario, Canada

Extensive knowledge in the use of GitHub, GitLab, Perforce, or similar for source management

Extensive knowledge in the use of build artifact management

Familiarity with the use of OpsGenie, Everbridge, or similar for notification management

Proven ability to influence others and strong analytical thinking skills are critical to success in this position

Demonstrated ability to conceptualize, manage, and prioritize multiple projects

2-5 years of experience building fully automated software applications and infrastructure provisioning and deployment pipelines

Site Reliability Engineer I

By Loblaw Companies Limited At Brampton, Ontario, Canada

Hands on experience in software engineering and one or multiple cloud vendors (AWS, GCP, Azure)

Identify and diagnose deficiencies related to existing frameworks, tools and processes, and recommend creative solutions to reduce waste and continuously improve.

Identifying and diagnosing deficiencies related to systems, coding and infrastructure, and recommending creative solutions for mitigation.

You are first and foremost a developer who understands ops or aspires to do ops stuff.

Passionate for troubleshooting technical problems and automating solutions to reduce manual toil

Inspired by working with both a Development and SRE mindset (i.e. software and infrastructure)

Site Reliability Engineer Jobs

By IBM At Markham, Ontario, Canada

Configuration management and infrastructure-as-code experience (Terraform and Kubernetes/OpenShift preferred)

2+ year’s Experience with provisioning and configuration management systems (terraform, ansible) across multiple cloud providers

Experience with remote bare metal hardware provisioning. PXE boot, working with remote hands

Multiple hosting models preferred (managed, colo, and AWS/multi-cloud)

Extra kudos for experience with:

Minimum of 4 to 7 years' experience in hands-on global production system deployment, administration and troubleshooting

Senior Engineer Ii - Digital Site Reliability

By lululemon At Vancouver, British Columbia, Canada

Contribute to engineering automation, management or development of pre-prod and production systems

Mentor and guide junior team members, sharing knowledge and expertise to foster a culture of learning and continuous improvement.

Eight+ years of engineering experience

Five+ years experience with CI/CD tools, GitLab preferred

Proficiency in at least one programming language (e.g., Python, Go, Java) and experience with scripting and automation.

Acknowledge the presence of choice in every moment and take personal responsibility for your life.

Senior/Principal Site Reliability Engineer (Sre)

By CorGTA Inc. At Mississauga, Ontario, Canada

· Experience with configuration management tools such as Ansible, YAML and Terraform.

· Strong verbal and written communications skills Solid knowledge of web architecture and systems.

· 3+ years of experience as an SRE supporting production infrastructure.

· 5+ years of overall software engineering experience in a development environment.

· Bachelor’s degree in computer science and/or a wide range of relevant work experience.

· Extensive experience with Azure.

Backend Site Reliability Engineer - The Sims

By Maxis Studios - EA At Toronto, Ontario, Canada

Demonstrate excellent while responding to the changing requirements of the game development process

Support a great player experience for The Sims players by participating in live service support, incident troubleshooting, and resolution

You have 2+ years of job experience in a hands-on coding, DevOps, or infrastructure configuration role

You have experience with load testing, troubleshooting, and optimizing the performance of web services

You’re willing to support a great player experience by participating in live service support and incident troubleshooting and resolution

You have experience with CI/CD pipeline technology like Gitlab CI

Contract Senior Site Reliability Engineer (Sre) With Azure - $85-90 P/H Inc.

By CorGTA Inc. At Mississauga, Ontario, Canada

Experience with configuration management tools such as Ansible, YAML and Terraform.

Strong verbal and written communications skills Solid knowledge of web architecture and systems.

3+ years of experience as an SRE supporting production infrastructure.

5+ years of overall software engineering experience in a development environment.

Bachelor’s degree in computer science and/or a wide range of relevant work experience.

Experience with container orchestration platforms such as Kubernetes.

Bhjob15656_20180 - Sre (Site Reliability Engineer) Specialist - Digital Infrastracture

By Myticas Consulting At Ontario, Canada

Manage, troubleshoot and audit Identity Access Management for employees, customers and partners with the focus on least privilege principle

5-10 years of experience as a SRE, DevOps or System Administrator

You have a good knowledge of Cybersecurity practices (e.g. phishing, thread/intrusion detection, malware protection, …)

Evaluate and Design evolutions of the infrastructure: IaC, deployments, automation, reliability

Operate company’s services in the Cloud, using Kubernetes, artifact registries, CI/CD tools, security scanning (e.g. SAST), etc.

Participate in the investigation and response to infrastructure and security incidents

Latest vacancies

Agente Ou Agent De Prévention De Soir Au Service De La Gestion Des Sentences
By Ministère de la sécurité publique At Montreal, Quebec, Canada 7 months ago
Lead Line Cook/Manager On Duty
By Impact Kitchen At Greater Toronto Area, Canada 7 months ago
Refinish Tech (Temporary) Jobs
By Boyd Group Services Inc. At Saskatoon, Saskatchewan, Canada 7 months ago
Vice President - Treasury
By Boyd Group Services Inc. At Winnipeg, Manitoba, Canada 7 months ago
Rock Mechanics Eit Jobs
By WSP in Canada At Greater Sudbury, Ontario, Canada 7 months ago

Site Reliability Engineer Lead at