Software Engineer - GenAI inference
Listing sourced from greenhouse on 5/3/2026. CVCraft does not host this job; clicking Apply redirects to the source.
Job Description
<p>P-1284</p> <h3><strong>About This Role</strong></h3> <p>As a software engineer for GenAI inference, you will help design, develop, and optimize the inference engine that powers Databricks’ Foundation Model API. You’ll work at the intersection of research and production, ensuring our large language model (LLM) serving systems are fast, scalable, and efficient. Your work will touch the full GenAI inference stack — from kernels and runtimes to orchestration and memory management.</p> <h3><strong>What You Will Do</strong></h3> <ul> <li>Contribute to the design and implementation of the inference engine, and collaborate on model-serving stack optimized for large-scale LLMs inference</li> <li>Collaborate with researchers to bring new model architectures or features (sparsity, activation compression, mixture-of-experts) into the engine</li> <li>Optimize for latency, throughput, memor
Salary Context for Software Engineer
The US national median for Software Engineer is $132,270; in San Francisco the median is $205,019 (+55% vs national).
Tailor your resume to this role before applying
75% of applicants get filtered by ATS before a human reads them. Run a free 60-second scan to see what keywords are missing from your resume.
Free ATS ScanSimilar Jobs
Data Center Security Specialist, DC Security - NTE
Data Centre Logistics Manager, MEL Cluster
Intern
Senior Specialist - Software Engineering
Senior Engineer - Backend Cryptography team- Hashicorp Vault
Specialist - Software Engineering
Pass the ATS Filter Before Applying
Most resumes never reach a human. CVCraft scans your resume against this exact role's keywords in 60 seconds — free.
Scan My Resume Free