Meta logo

Meta

Company

4 hours ago

Research Scientist Intern, Large Foundation Models and Generative AI (PhD)

Redmond, WA
Full-time

Job Description

The Meta Reality Labs Research Team is composed of world-class researchers, developers, and engineers dedicated to shaping the future of AR/VR and machine perception. The Surreal Vision group at RL Research is seeking exceptional Research Scientist interns to contribute to the development of an egocentric AI system. This system will form the foundation for contextual-AI-enabled AR devices and humanoid robots. As a research intern, you will tackle cutting-edge research challenges, innovating novel computer vision and machine learning techniques. Your research project may cover or relate to the following topics: - Egocentric vision language model for long-context 3D scene understanding - Utilizing memory for more consistent and accurate future state prediction using visual language action models or world models - Exploring novel learning strategies to improve the quality and generalization of visual language action models or world models with egocentric data - 4D generation & reconstruction of dynamic scenes Our internships are twelve (12) to twenty four (24) weeks long and we have various start dates throughout the year. Some projects may require a minimum of 24 consecutive weeks.
  • Plan and execute cutting-edge research and development to advance the state-of-the-art in machine perception, future prediction, 4D scene understanding & reconstruction, robotics.
  • Collaborate with other researchers and engineers across machine perception teams at Meta to develop experiments, prototypes, and concepts that advance the state-of-the-art in AR/VR and AI systems.
  • Work with the team to help design, setup, and run practical experiments and prototype systems related to large-scale high quality sensing and machine reasoning.
Minimum Qualifications
  • Currently has, or is in the process of obtaining a PhD degree in the domain of computer-vision, machine learning, robotics, and computer graphics
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment
  • Knowledge and hands-on experience on 3D computer vision
  • Hands-on experience implementing large foundation models and generative models, such as LLMs, VLMs, video diffusion models, LRMs, World Models, VLAs, Reinforcement Learning
  • Experience working within Python environments such as pytorch
  • Experience working in a Unix environment
Preferred Qualifications
  • Ability to work a consecutive 24 weeks
  • Proven track record of achieving significant results as demonstrated by grants, fellowships, patents, as well as first-authored publications at leading workshops or conferences such as CVPR/ECCV/ICCV, ICLR, NeurIPS, CoRL/RSS/ICRA/IROS, SIGGRAPH/SIGGRAPH Asia, etc
  • Strong track-record of published research in the fields of generative modeling, large foundation models, robotics, neural reconstruction, and neural rendering
  • Strong programming experience using python and pytorch
  • Demonstrated software engineer experience via an internship, work experience, coding competitions, or widely used contributions in open source repositories (e.g. GitHub)
  • Intent to return to a degree-program after the completion of the internship
  • Experience working and communicating cross functionally in a team environment
For those who live in or expect to work from California if hired for this position, please click here for additional information.
About Meta
Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today—beyond the constraints of screens, the limits of distance, and even the rules of physics.

$7,650/month to $12,134/month + benefits

Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.


Equal Employment Opportunity
Meta is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here.
Meta is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, fill out the Accommodations request form.

Please mention that you found this job on MoAIJobs, this helps us grow. Thank you!