Sesame

Bringing the computer to life

About the Company

Sesame envisions a future where computers are truly lifelike—capable of seeing, hearing, and collaborating in natural, human-like ways. With a focus on building next-generation voice companions, the company is rethinking how people interact with technology. The founding team includes leaders from Oculus and Ubiquity6, alongside experienced innovators from Meta, Google, and Apple, combining deep expertise across both hardware and software.

About the Role

This role focuses on advancing the intersection of 3D computer vision and machine learning to power Sesame’s wearable devices. By integrating vision understanding with conversational AI, the position aims to bridge speech and the physical world. The engineer will address challenges in areas such as gaze tracking, SLAM, and embedding physical constraints into data-driven models, collaborating across research, hardware, and product teams to transform cutting-edge techniques into real-world applications.

Responsibilities

  • Develop machine learning models across a variety of 3D computer vision tasks.
  • Contribute across the full ML pipeline: model architecture, data capture, curation, training, evaluation, and inference infrastructure.
  • Collaborate with firmware and hardware teams to deploy vision models onto embedded systems.
  • Identify promising approaches from academic research and create novel techniques to achieve unique product goals.

Required Skills

  • Proven ability to work independently in ambiguous and fast-changing environments.
  • Strong background in developing computer vision and machine learning models.
  • Familiarity with current state-of-the-art methods in computer vision.
  • Proficiency with deep learning frameworks such as PyTorch or JAX.
  • Experience handling large-scale datasets, including multi-camera data.
  • Excellent communication and cross-functional collaboration skills.
  • Bachelor’s degree or higher in Computer Science, Computer Vision, Applied Mathematics, Machine Learning, or a related field.

Preferred Qualifications

  • Master’s or Ph.D. in a relevant field.
  • Experience deploying models in real-world products.
  • Background in startup environments.
  • Expertise incorporating geometric, physical, or structural priors into ML models.

Compensation and Benefits

  • Salary range: $190,000 – $320,000 annually (depending on experience and location).
  • 401(k) matching program.
  • 100% employer-paid health, vision, and dental insurance.
  • Unlimited paid time off and sick leave.
  • Flexible spending account (FSA) matching.
  • Benefits apply to full-time employees (not contingent/contract workers).

Explore the complete job description by visiting the official website provided: