Lead - Organizational Reliability
By Canadian Natural Resources Limited (CNRL) At Calgary, Alberta, Canada
Contribute to hiring, performance reviews and performance management of your team
Demonstrated leadership experience with cross-functional teams; effective communication, organization & analytical skills
Practitioner of Project Management, Lean Six Sigma (LSS) and Four Disciplines of Execution Excellence (4DX)
Competitive salary, stock options, stock savings plan and benefits
Minimum 7 years’ experience; including 5+ years of Oil & Gas industry and 3+ years of Engineering or Technical role
Certification of Lean Six Sigma Yellow Belt is required and Green Belt or higher is considered an asset
Site Reliability Engineer / Python Developer
By The Blue Mount At Montreal, Quebec, Canada
- Manage Service reliability by managing risk
- Hands on experience using Enterprise Tools such as App Dynamic, Grafana, Splunk, Dynatrace
- Three Tier Support experience with DBs such as IBM, DB2, Sybase, Mongo, Green Plum, KDB
- Generally speaking, practical experience running large scale online systems is always an advantage.
- Knowledge of messaging layer: MQ / CPS / XML
SITE Reliability Engineer We are looking for only permanent residents
Site Reliability Engineer 2 Jobs
By Microsoft At Ottawa, Ontario, Canada
Excellent collaboration, organizational, time management skills.
Knowledge of data governance and data management practices.
3+ years people management experience.
Project Management Institute (PMI) or equivalent Project Management certification.
Improve Customer experience by analyzing signals from various sources, driving RCA's and Service improvements involving bug fixes
Identify and drive requirements for increased customer self-supportability
Site Reliability Specialist Jobs
By CDW Canada At Toronto, Ontario, Canada
5+ years’ experience in Infrastructure Management including a focus on customer service and complex technical assistance
Experience n a Managed Service Provider environment
Translate requirements/stories and mock-ups into fully functional features
Bachelor of Science in Computer Science or related technical degree or equivalent work experience
3+ years’ experience with private and public Cloud technologies
Experience with support ticketing systems and monitoring systems
Supervisor, Site Reliability Jobs
By CDW Canada At Toronto, Ontario, Canada
Experience leading a technical team from a position of direct personnel management (Lead, Supervisor of Manager)
Participate in the change management process, providing oversight on peer reviews and approvals of changes
Demonstrated skill in prioritizing multiple responsibilities, tasks and projects simultaneously including ability to shift priorities
5-7 years of systems design, technical architecture, and support experience
Bachelor of Science in Computer Science or related technical degree or equivalent work experience
Familiarity and deep experience implementing ITIL best practices
Specialist Site Reliability Engineer (Sre)- En
By CN At Calgary, Alberta, Canada
Knowledge And/or Experience In The Following Areas
Review and approve solution requirements for RAM
Determine non-functional requirements and targets for RAM performance
Assign requirements to solutions and products to ensure they support the ability to measure RAM Key Performance Indicators (KPI’s)
Minimum 5-10 years overall work experience
Bachelor’s degree in Electrical Engineering, Mechanical Engineering, Computer Science, Computer Engineering or equivalent degree & experience
Sr. Site Reliability Administrator
By OpenText At Mississauga, Ontario, Canada
Establish and maintain a good relationship with team members, Product Development, Product Management, Customer Service, Client management, and other cross-functional teams.
Experience and knowledge in RDBMS and No-Sql databases such as Oracle, Postgres, MariaDB, and Cassandra.
Uses technical knowledge, creativity, and company practices to drive down occurrences of incidents through the development of proactive monitoring and alerting.
Hands-on experience with cloud infrastructure; Google, AWS, or Azure a plus
Experience with PaaS technologies such as Cloud Foundry, Kubernetes, and Bosh.
Good understanding and operational experience with container technologies like Docker, rkt, Mesos.
Site Reliability Engineer Jobs
By Aquanow At Vancouver, British Columbia, Canada
Configuration Management experience such as Ansible, Chef, Puppet, or similar.
Have excellent problem solving, time management skills, ability to work independently or as part of a team.
Assist with documentation on knowledge base articles, operation manuals and incident and problem notes.
At least three years of experience with the administration, monitoring, and troubleshooting of cloud and on prem services.
Experience in supporting cloud applications including API connections.
Experience with cloud services such as AWS.
Site Reliability Engineer - Linux
By Astreya At Canada
Experience with configuration management and orchestration tools like Ansible and Jenkins
Responsible for securing Linux servers including identity, patch, and access management.
Bachelor’s degree (B.S/B.A) from four-college or university and 5+ years’ related experience and/or training; or equivalent combination of education and experience
Astreya offers comprehensive benefits to all Regular, Full-Time Employees, including:
Keep systems properly configured, updated, healthy, hardened and scaled to meet the project/operational requirements.
Proven experience with Linux server administration – applying patches, ensuring health, right-sizing, architecture standards, and troubleshooting problems.
Lead Data Engineer- Toronto, Canada (On-Site)
By Lorven Technologies Inc. At Toronto, Ontario, Canada
Knowledge & experience with driving IaC concepts within an organization, leveraging Terraform, Ansible, GitHub actions.
Quickly understand organizational dynamics and management priorities, and to be able to work effectively in a fast paced, results driven company.
Conceptualize, design, and implement analytics products that enhance Investments analytics capabilities.
Participate in the development life cycle from start to completion - requirements analysis, development, testing, and deployment.
Ensure architecture will support the requirements of the Investments business.
Develop tools that prepare, transform, combine, and manage structured and unstructured data for use by Investments business users.
Staff Site Reliability Engineer - Remote
By Luxury Presence At Canada
Professional experience with security practices, credential rotations, secrets management systems (ideally Vault)
Oversee the management and optimization of Kubernetes clusters to ensure smooth operations, scalability, and resource utilization
Terraform / ArgoCD / Crossplane.io for resources management
Vault project for secrets and password management
Develop and maintain IaC using tools like Terraform and Crossplane to provision and manage AWS and Kubernetes resources efficiently and consistently
Provide guidance and mentorship to junior SREs and share knowledge with the broader team
Senior Site Reliability Engineer
By Lyft At Montreal, Quebec, Canada
Share knowledge by giving brown bags, tech talks, and evangelizing appropriate tech and engineering best practices.
Share on-call responsibilities with other teammates and own the improvement of the team’s on-call practices
Experience designing, implementing and operating large-scale customer-facing SaaS infrastructure
Experience with high level programming languages (Python, Go, Java, etc.) and declarative languages (eg. Terraform).
Experience working with public cloud platforms (eg. AWS, Google Cloud Platform, Microsoft Azure, etc.) and container orchestrators (eg. Kubernetes)
Strong troubleshooting and debugging skills
Sr. Site Reliability Administrator
By OpenText At Waterloo, Ontario, Canada
Extensive knowledge in the use of GitHub, GitLab, Perforce, or similar for source management
Extensive knowledge in the use of build artifact management
Familiarity with the use of OpsGenie, Everbridge, or similar for notification management
Proven ability to influence others and strong analytical thinking skills are critical to success in this position
Demonstrated ability to conceptualize, manage, and prioritize multiple projects
2-5 years of experience building fully automated software applications and infrastructure provisioning and deployment pipelines
Site Reliability Engineer I
By Loblaw Companies Limited At Brampton, Ontario, Canada
Hands on experience in software engineering and one or multiple cloud vendors (AWS, GCP, Azure)
Identify and diagnose deficiencies related to existing frameworks, tools and processes, and recommend creative solutions to reduce waste and continuously improve.
Identifying and diagnosing deficiencies related to systems, coding and infrastructure, and recommending creative solutions for mitigation.
You are first and foremost a developer who understands ops or aspires to do ops stuff.
Passionate for troubleshooting technical problems and automating solutions to reduce manual toil
Inspired by working with both a Development and SRE mindset (i.e. software and infrastructure)
Site Reliability Engineer Jobs
By IBM At Markham, Ontario, Canada
Configuration management and infrastructure-as-code experience (Terraform and Kubernetes/OpenShift preferred)
2+ year’s Experience with provisioning and configuration management systems (terraform, ansible) across multiple cloud providers
Experience with remote bare metal hardware provisioning. PXE boot, working with remote hands
Multiple hosting models preferred (managed, colo, and AWS/multi-cloud)
Extra kudos for experience with:
Minimum of 4 to 7 years' experience in hands-on global production system deployment, administration and troubleshooting
Senior Engineer Ii - Digital Site Reliability
By lululemon At Vancouver, British Columbia, Canada
Contribute to engineering automation, management or development of pre-prod and production systems
Mentor and guide junior team members, sharing knowledge and expertise to foster a culture of learning and continuous improvement.
Eight+ years of engineering experience
Five+ years experience with CI/CD tools, GitLab preferred
Proficiency in at least one programming language (e.g., Python, Go, Java) and experience with scripting and automation.
Acknowledge the presence of choice in every moment and take personal responsibility for your life.
Senior/Principal Site Reliability Engineer (Sre)
By CorGTA Inc. At Mississauga, Ontario, Canada
· Experience with configuration management tools such as Ansible, YAML and Terraform.
· Strong verbal and written communications skills Solid knowledge of web architecture and systems.
· 3+ years of experience as an SRE supporting production infrastructure.
· 5+ years of overall software engineering experience in a development environment.
· Bachelor’s degree in computer science and/or a wide range of relevant work experience.
· Extensive experience with Azure.
Backend Site Reliability Engineer - The Sims
By Maxis Studios - EA At Toronto, Ontario, Canada
Demonstrate excellent while responding to the changing requirements of the game development process
Support a great player experience for The Sims players by participating in live service support, incident troubleshooting, and resolution
You have 2+ years of job experience in a hands-on coding, DevOps, or infrastructure configuration role
You have experience with load testing, troubleshooting, and optimizing the performance of web services
You’re willing to support a great player experience by participating in live service support and incident troubleshooting and resolution
You have experience with CI/CD pipeline technology like Gitlab CI
Contract Senior Site Reliability Engineer (Sre) With Azure - $85-90 P/H Inc.
By CorGTA Inc. At Mississauga, Ontario, Canada
Experience with configuration management tools such as Ansible, YAML and Terraform.
Strong verbal and written communications skills Solid knowledge of web architecture and systems.
3+ years of experience as an SRE supporting production infrastructure.
5+ years of overall software engineering experience in a development environment.
Bachelor’s degree in computer science and/or a wide range of relevant work experience.
Experience with container orchestration platforms such as Kubernetes.
Bhjob15656_20180 - Sre (Site Reliability Engineer) Specialist - Digital Infrastracture
By Myticas Consulting At Ontario, Canada
Manage, troubleshoot and audit Identity Access Management for employees, customers and partners with the focus on least privilege principle
5-10 years of experience as a SRE, DevOps or System Administrator
You have a good knowledge of Cybersecurity practices (e.g. phishing, thread/intrusion detection, malware protection, …)
Evaluate and Design evolutions of the infrastructure: IaC, deployments, automation, reliability
Operate company’s services in the Cloud, using Kubernetes, artifact registries, CI/CD tools, security scanning (e.g. SAST), etc.
Participate in the investigation and response to infrastructure and security incidents