Lead Inference Performance Engineer Jobs in Toronto, Canada

Professional job seekers finding Canada jobs through Expertini
750,000+ professionals on Expertini 750,000+ Candidates
Join our global community
Expertini Penguin Mascot Resume Score™
Resume Score™ Instantly
Upload Your CV
Quick 30-second process

Apply Today & Jumpstart Your Career on Expertini, Trusted Since 2008.

Reset

Create Job Alert

 
   
Reset

AI Inference Performance Benchmark Engineer

A leading AI technology firm in Toronto is seeking passionate engineers to join its Inference Core Platform group. The role involves developing foundational software and hardware infrastructure for high-speed AI inference. Ideal candidates will have a Bachelor’s or Master’s degree in Computer Engineering, profic ...

AI Inference Performance Benchmark Engineer

A leading AI technology firm in Toronto is seeking passionate engineers to join its Inference Core Platform group. The role involves developing foundational software and hardware infrastructure for high-speed AI inference. Ideal candidates will have a Bachelor’s or Master’s degree in Computer Engineering, profic ...

Performance Engineer for AI Inference

Elevate AI inference systems as a Performance Engineer, specializing in model evaluations and optimization on wafer-scale technology. Engage with the latest innovations to implement enhancements for greater efficiency.This position focuses on bringing state-of-the-art AI models to production through ri ...

Performance Engineer for AI Inference

Elevate AI inference systems as a Performance Engineer, specializing in model evaluations and optimization on wafer-scale technology. Engage with the latest innovations to implement enhancements for greater efficiency.This position focuses on bringing state-of-the-art AI models to production through rigorous ...

Performance Engineer for AI Inference

Elevate AI inference systems as a Performance Engineer, specializing in model evaluations and optimization on wafer-scale technology. Engage with the latest innovations to implement enhancements for greater efficiency. This position focuses on bringing state-of-the-art AI models to production throug ...

Inference Performance Engineer, ML Systems & Optimization

A leading AI technology company in Toronto is seeking an experienced software engineer to join their inference model team. This role involves prototyping AI architectural tweaks, developing benchmarking automation, and collaborating closely with silicon teams. Candidates should have over 3 years of experience in ...

Inference Performance Engineer, ML Systems & Optimization

A leading AI technology company in Toronto is seeking an experienced software engineer to join their inference model team. This role involves prototyping AI architectural tweaks, developing benchmarking automation, and collaborating closely with silicon teams. Candidates should have over 3 years of experience in ...

Inference Performance Engineer, ML Systems & Optimization

A leading AI technology company in Toronto is seeking an experienced software engineer to join their inference model team. This role involves prototyping AI architectural tweaks, developing benchmarking automation, and collaborating closely with silicon teams. Candidates should have over 3 years of experience in ...

Inference Performance Engineer, ML Systems & Optimization

A leading AI technology company in Toronto is seeking an experienced software engineer to join their inference model team. This role involves prototyping AI architectural tweaks, developing benchmarking automation, and collaborating closely with silicon teams. Candidates should have over 3 years of experience in ...

Performance Engineer for AI Benchmarking and Inference Insights

Enhance AI capabilities as a Performance Engineer focused on Benchmarking! Collaborate with experts to measure, analyze, and optimize performance in pioneering AI systems deployed at scale.In this exciting role within the Inference Core Platform, you will be instrumental in defining and driving the per ...

Performance Engineer for AI Benchmarking and Inference Insights

Enhance AI capabilities as a Performance Engineer focused on Benchmarking! Collaborate with experts to measure, analyze, and optimize performance in pioneering AI systems deployed at scale.In this exciting role within the Inference Core Platform, you will be instrumental in defining and driving the performan ...

Performance Engineer for AI Benchmarking and Inference Insights

Enhance AI capabilities as a Performance Engineer focused on Benchmarking! Collaborate with experts to measure, analyze, and optimize performance in pioneering AI systems deployed at scale.In this exciting role within the Inference Core Platform, you will be instrumental in defining and driving the performan ...

Staff Frontend Engineer: Inference Platform Lead

A leading AI tech firm in Canada is seeking a full-stack Technical Lead to own critical areas of the Developer Console. This deeply technical role requires building high-quality frontend systems and designing backend services that scale effectively. You will drive technical direction and mentor engineers, ensuri ...

Lead Generative AI Inference Systems Engineer

A leading AI hardware company in Toronto seeks a Senior Software Engineer for its Inference ML Engineering team. This role involves designing APIs and tools for state-of-the-art generative AI models on custom hardware, optimizing performance, and leading cross-functional initiatives. Ideal candidates will have 8 ...

Lead Generative AI Inference Systems Engineer

A leading AI hardware company in Toronto seeks a Senior Software Engineer for its Inference ML Engineering team. This role involves designing APIs and tools for state-of-the-art generative AI models on custom hardware, optimizing performance, and leading cross-functional initiatives. Ideal candidates will have 8 ...

Lead Generative AI Inference Systems Engineer

A leading AI hardware company in Toronto seeks a Senior Software Engineer for its Inference ML Engineering team. This role involves designing APIs and tools for state-of-the-art generative AI models on custom hardware, optimizing performance, and leading cross-functional initiatives. Ideal candidates will have 8 ...

Full Stack Technical Lead for High Performance Inference Systems

Lead the transformation of AI workloads as a Full-Stack Technical Lead focused on performance-driven inference systems. Develop high-quality applications that ensure real-time efficiency and user engagement.This role focuses on producing cutting-edge technology by bridging frontend and backend efforts. ...

Full Stack Technical Lead for High Performance Inference Systems

Lead the transformation of AI workloads as a Full-Stack Technical Lead focused on performance-driven inference systems. Develop high-quality applications that ensure real-time efficiency and user engagement.This role focuses on producing cutting-edge technology by bridging frontend and backend efforts. Your ...

Full Stack Technical Lead for High Performance Inference Systems

Lead the transformation of AI workloads as a Full-Stack Technical Lead focused on performance-driven inference systems. Develop high-quality applications that ensure real-time efficiency and user engagement. This role focuses on producing cutting-edge technology by bridging frontend and backend effo ...

Staff FE Engineer Inference

About The Role We’re hiring a staff level full-stack Technical Lead (L6/L7) to own and scale critical parts of the Cerebras Developer Console — the primary interface developers and enterprises use to run and manage inference workloads. This is a deeply technical, end-to-end role. You’ll b ...

Staff FE Engineer Inference

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inf ...

Deployment Engineer, AI Inference

About Cerebras Systems Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of one device. This enables industry‑leading training and inference speeds an ...

Staff FE Engineer Inference

About The Role We’re hiring a staff level full-stack Technical Lead (L6/L7) to own and scale critical parts of the Cerebras Developer Console — the primary interface developers and enterprises use to run and manage inference workloads.This is a deeply technical, end-to-end role. You’ll build high-quali ...

Software Engineer – Inference Serving

Join to apply for the Software Engineer – Inference Serving role at Taalas At Taalas we believe that fundamental progress is achieved by those who are willing to understand and assail a problem end-to-end, without regard for commonly accepted abstractions and boundaries. We are buil ...

Staff FE Engineer Inference

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference ...

Software Engineer – Inference Serving

Join to apply for theSoftware Engineer – Inference Servingrole atTaalasAt Taalas we believe that fundamental progress is achieved by those who are willing to understand and assail a problem end-to-end, without regard for commonly accepted abstractions and boundaries. We ar ...

Staff FE Engineer Inference

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry‑leading training and inference ...

Senior Inference Engineer AI

# **Our Privacy Statement & Cookie Policy**New Position: This position is open due to an existing vacancy to support our evolving business needs.Thomson Reuters is seeking a Senior Inference Engineer, AI. This person will collaborate with platform teams to enhance capacity forecasting for AI workloads and work w ...

Deployment Engineer, AI Inference

About Cerebras Systems Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of one device. This enables industry‑leading training and inf ...

Staff FE Engineer Inference

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inf ...

Senior Inference Engineer AI

# **Our Privacy Statement & Cookie Policy**New Position: This position is open due to an existing vacancy to support our evolving business needs.Thomson Reuters is seeking a Senior Inference Engineer, AI. This person will collaborate with platform teams to enhance capacity forecasting for AI workloads and work w ...

Software Engineer – Inference Serving

Join to apply for the Software Engineer – Inference Serving role at Taalas At Taalas we believe that fundamental progress is achieved by those who are willing to understand and assail a problem end-to-end, without regard for commonly accepted abstractions and boundaries. We are buil ...

Senior Inference Engineer AI

# **Our Privacy Statement & Cookie Policy**New Position: This position is open due to an existing vacancy to support our evolving business needs.Thomson Reuters is seeking a Senior Inference Engineer, AI. This person will collaborate with platform teams to enhance capacity forecasting for AI workloads and work w ...

Staff FE Engineer Inference

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inf ...

Staff FE Engineer Inference

About The Role We’re hiring a staff level full-stack Technical Lead (L6/L7) to own and scale critical parts of the Cerebras Developer Console — the primary interface developers and enterprises use to run and manage inference workloads. This is a deeply technical, end-to-end role. ...

Deployment Engineer, AI Inference

About Cerebras Systems Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of one device. This enables industry‑leading training and ...

Software Engineer – Inference Serving

Join to apply for the Software Engineer – Inference Serving role at Taalas At Taalas we believe that fundamental progress is achieved by those who are willing to understand and assail a problem end-to-end, without regard for commonly accepted abstractions and boundaries. We are ...

Senior Software Engineer, AI Inference

Help us push the boundaries of AI inference at NVIDIA — where your systems expertise shapes both the technology and the teams building on top of it! We're looking for a Senior Software Engineer to work at the frontier of large-scale LLM serving, partnering directly with some of the world's most te ...

Staff Frontend Engineer – Inference Platform

A leading AI technology firm based in Toronto is seeking a full-stack Technical Lead to own and scale critical parts of its Developer Console. You will build high-quality systems and make strong architectural decisions that directly impact customer experience. The ideal candidate will have 8+ years of experience ...

Senior Software Engineer, AI Inference

Senior Software Engineer, AI Inference page is loaded## Senior Software Engineer, AI Inferencelocations:Canada, Torontotime type:Full timeposted on:Posted 2 Days Agojob requisition id:JR Help us push the boundaries of AI inference at NVIDIA — where your systems expertise s ...

AI Inference Digital Design Engineer

A tech company is seeking a passionate Digital Design Engineer in Toronto to work on complex AI Inference challenges. Candidates should have a degree in Electrical or Computer Engineering and proficiency in Verilog/VHDL. Responsibilities include writing RTL specifications, collaborating on layouts, and providing ...

Senior Software Engineer, AI Inference

Senior Software Engineer, AI Inference page is loaded## Senior Software Engineer, AI Inferencelocations: Canada, Torontotime type: Full timeposted on: Posted 2 Days Agojob requisition id: JR Help us push the boundaries of AI inference at NVIDIA — where your systems expertise shapes both the technology an ...

Staff Inference ML Runtime Engineer

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry‑leading training and inference ...

Staff Frontend Engineer – Inference Platform

A leading AI technology firm based in Toronto is seeking a full-stack Technical Lead to own and scale critical parts of its Developer Console. You will build high-quality systems and make strong architectural decisions that directly impact customer experience. The ideal candidate will have 8+ years of experience ...

AI Inference Digital Design Engineer

A tech company is seeking a passionate Digital Design Engineer in Toronto to work on complex AI Inference challenges. Candidates should have a degree in Electrical or Computer Engineering and proficiency in Verilog/VHDL. Responsibilities include writing RTL specifications, collaborating on layouts, and providing ...

Senior ML Inference Platform Engineer

A pioneering AI technology company in Canada is seeking a Senior Software Engineer to lead the integration of machine learning frameworks. This role involves designing APIs for user-defined ML models, collaborating closely with cross-functional teams, and optimizing performance for high-throughput execution. Can ...

Sr. Inference ML Runtime Engineer

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry‑leading training and inf ...

Sr. Inference ML Runtime Engineer

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry‑leading training and inf ...

Senior Software Engineer, AI Inference

Senior Software Engineer, AI Inference page is loaded## Senior Software Engineer, AI Inferencelocations: Canada, Torontotime type: Full timeposted on: Posted 2 Days Agojob requisition id: JR Help us push the boundaries of AI inference at NVIDIA — where your systems expertise shapes both the technology an ...

Sr. Inference ML Runtime Engineer

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry‑leading training and inf ...

🚀 Boost Your Hiring Chances with Our AI-Powered Tool-Kit

Stand out from thousands of applicants. Use our proven career tools to optimize your applications and land your dream job faster.

To-Do Planner

Organize your job search and personal tasks. All data is confidential.

Open Planner

Wellbeing Center

Access your confidential wellness report and resources to manage job search stress.

Check Wellbeing

Skill Coach

Plan your skill development with O*NET support to stay competitive in your field.

Start Coaching

Outfit Helper

Get AI-powered suggestions on what to wear for your next interview.

Find Outfit

Income Tax Calculator

Plan your finances with our calculator, updated for 2025 tax regulations.

Calculate Tax

Salary Benchmark

Get accurate, AI-supported salary trends to know your worth and negotiate better.

Check Salaries

Interview Practice

Practice for any interview with AI-enabled Q&A sessions. All data is private.

Start Practicing

Interview Predictor

Use our AI-supported tool to predict potential interview questions based on your resume.

Predict Questions

Interview Practice Timer

Use our mock interview trainer to perfect your answers under timed conditions.

Start Timer

Behavioral Mastery

Ace tricky behavioral interviews with our AI-powered practice module.

Master Answers

Question Journal

Confidentially record interview questions you were asked for future reference.

Open Journal

Interview Ace

A comprehensive tool to help you master every aspect of your interviews.

Become an Ace

Q&A Logs

Confidentially track your answers to common questions and refine them over time.

View Logs

Application Planner

Schedule and organize your job applications in one confidential planner.

Open Planner

Cover Letter Tool

Create perfect, tailored cover letters for each application with AI support.

Generate Letter

Resume Score

Get instant feedback on your resume with our NLP-supported analysis tool.

Check My Score

ATS Score

Check your resume's compatibility with Applicant Tracking Systems (ATS).

Check ATS Score

Application Analyzer

Use AI to analyze job descriptions and optimize your application materials.

Analyze Application

Career Visualizer

Confidentially plan and visualize your long-term career path and goals.

Visualize My Career

Offer Genius

Get intelligent insights and strategies to confidently negotiate job offers.

Negotiate Offers

JobFlow

Track your entire job search progress from application to offer with this intelligent tool.

Track My Flow

JobSense

Our intelligent matching engine that provides smart job recommendations.

Get Smart Matches

Networking Toolkit

Tools to build and manage your professional connections. All data is confidential.

Build Network

Professional CV

A classic, O*NET supported template for corporate and professional roles.

Use This Template

Executive CV

A premium, O*NET supported template designed for senior and C-level positions.

Use This Template

Modern CV

A fresh, stylish, O*NET supported template perfect for tech and modern industries.

Use This Template

Creative CV

A visually distinct, O*NET supported template for design and artistic roles.

Use This Template

Minimalist CV

A clean, simple, O*NET supported template that focuses purely on content.

Use This Template

Europass CV

The standard European Union recommended format for wide compatibility.

Use This Template

Student CV

An institution-recommended template perfect for internships and first jobs.

Use This Template

Graduate CV

An institution-recommended template for recent graduates entering the workforce.

Use This Template

Academic CV

The researcher-recommended format for roles in academia and research.

Use This Template

Developer/IT CV

A tech-savvy recommended template to highlight your technical skills.

Use This Template

Skilled Worker CV

A trades-recommended template to showcase hands-on skills and experience.

Use This Template

Monochrome CV

A sleek, black-and-white, O*NET supported template for a professional look.

Use This Template

Art CV

An artist-recommended template that allows your creativity to shine.

Use This Template

Harvard CV

A researcher-recommended template based on the classic Harvard format.

Use This Template

Volunteer Research

Help us improve our platform by joining our community research program.

Join Research

Review Us

Share your experience with our tools to help other job seekers.

Share Experience

Register

Create your free account to save jobs, build your profile, and track applications.

Create Account

Login

Access your dashboard, manage applications, and continue your job search.

Access Your Account

Profile Builder

Create a comprehensive professional profile that attracts recruiters and showcases your skills.

Build Your Profile

View Profile

See your public profile exactly as employers will see it. Make sure it's perfect.

Preview Profile

Bookmarked Jobs

Keep track of all your saved job opportunities in one organized place.

View Saved Jobs

Your Reviews

View and manage all the company reviews you've submitted.

See Your Reviews

Following

Manage the list of companies you follow to stay updated on their new openings.

Manage Following

Find Companies

Discover and research top employers in your country and industry.

Discover Employers

Standalone CV Builder

Use our O*NET supported CV builder to create a professional resume from scratch.

Build Your CV

PDF to DOC (Beta)

Convert your PDF resumes or documents into editable Word (DOC) format.

Convert PDF

DOC to PDF (Beta)

Create universally compatible PDF documents from your Word (DOC) files.

Create PDF

General FAQ

Find answers to common questions about our job site and platform.

Read FAQ

Job Seekers FAQ

Get help and find answers to questions specifically for job seekers.

Get Help

Job Matching

Learn about the technology and algorithms behind how we match you to jobs.

Learn How

Personalized Matching

Discover how we use your profile and activity to provide customized job suggestions.

Learn More

Quick Apply

Understand our fast application process and how to make the most of it.

Learn More

Alert Frequency

Learn how to manage your job alert settings so you get the updates you want.

Manage Settings

Job Alerts Guide

A complete guide to understanding how job alerts work and how to use them effectively.

Read Guide

Resume Matching

Learn how our system matches your resume to job requirements.

Learn More

Ethical Branding

Read our guide to building a professional and ethical personal brand.

Read Guide

Candidate Visibility

Learn how to increase your visibility to recruiters on our platform.

Increase Visibility

Verified Badge

Find out how you can get a verified badge to build trust with employers.

Get Verified

AI ATS Technology

Learn about the advanced AI and ATS technology that powers our platform.

Learn More

ATS Ranking

Understand how Applicant Tracking Systems rank you as an applicant.

Learn More

Semantic Matching

Learn how our AI-powered semantic matching goes beyond keywords.

Learn More

    Lead Inference Performance Engineer Jobs in Toronto Job Search Guide, Trends and Insights