Distributed Systems Engineer Job at Magic, Remote

b3NQQ1BDZGg1bzh6aUNlczBIZWxnWk5XQ1E9PQ==
  • Magic
  • Remote

Job Description

Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alone. Our approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal.

About the role:

As a distributed systems engineer, you will build the data and coordination systems that enable ultra-long context inference and training on Magic’s GPU clusters. 

What you might work on: 

  • High-performance storage and caching systems to support long-context inference and training

  • Hacking on the internals of deep learning frameworks in the distributed setting

  • Automating fault detection and recovery systems to enable highly available training

  • Troubleshooting complex issues across GPUs, network, storage, OS, and cloud environments.

What we’re looking for: 

  • Deep knowledge of distributed systems design and public cloud platforms

  • Experience designing and operating highly available, high-throughput data systems

  • Experience with the internals of distributed DBMS, batch and stream processing systems, and/or distributed file systems

  • Exceptional problem-solving skills up and down the stack

Magic strives to be the place where high-potential individuals can do their best work. We value quick learning and grit just as much as skill and experience.

Our culture:

  • Integrity. Words and actions should be aligned

  • Hands-on. At Magic, everyone is building 

  • Teamwork. We move as one team, not N individuals

  • Focus. Safely deploy AGI. Everything else is noise

  • Quality. Magic should feel like magic

Compensation, benefits and perks (US):

  • Annual salary range: $100K - $550K

  • Equity is a significant part of total compensation, in addition to salary

  • 401(k) plan with 6% salary matching

  • Generous health, dental and vision insurance for you and your dependents

  • Unlimited paid time off

  • Visa sponsorship and relocation stipend to bring you to SF, if possible

  • A small, fast-paced, highly focused team

Job Tags

Remote job, Relocation,

Similar Jobs

Johns Hopkins Medicine

PATIENT ESCORT JHBMC (Day Shift) Job at Johns Hopkins Medicine

JH Bayview Hospital Location: Responsible for the safe, timely, courteous, and competent transportation of patients and specimens throughout the Johns Hopkins Bayview Medical Center, primarily via wheelchair or stretcher. Receives assignments for specific trips and...

Fresenius Medical Care Holdings, Inc.

Acute Inpatient Patient Care Technician - PCT Job at Fresenius Medical Care Holdings, Inc.

 ...Training Provided! PCT will cover area hospitals in Columbus, Seymour and Bloomington IN Full-Time Rotating weekly call PURPOSE...  ...CPR certification. EO/AA Employer: Minorities/Females/Veterans/Disability/Sexual Orientation/Gender Identity Fresenius... 

SHENZHEN YUELAIYUEHAO NEW RETAIL TECHNOLOGY CO., LTD

TIKTOK BD Job at SHENZHEN YUELAIYUEHAO NEW RETAIL TECHNOLOGY CO., LTD

Denduan Jianlian

TG

Web front end development engineer Job at TG

1. Responsible for the development of PC and mobile front-end interfaces of the company's products and the realization of various interaction requirements;2. Cooperate with product manager, designer and back end to complete project maintenance, improvement and reconstruction... 

NMS Management Services

Mobile Phlebotomist Job at NMS Management Services

 ...NMS is looking for a Mobile Phlebotomist. Job Details: The candidate/phlebotomist is responsible for venipuncture via the vacutainer...  .../scanner: to print, scan, and upload completed documents. About NMS: NMS provides nationwide concierge mobile phlebotomy....