-
ST0238: Internship - Multi-Modal Sensing and Understanding
The Computational Sensing team at MERL is seeking a highly motivated intern to conduct fundamental research on multi-modal sensing and understanding —algorithms that can understand, explain, and act on multi-sensor data (e.g., RF, infrared, LiDAR, event camera). Ideal candidates will be comfortable bridging state-of-the-art perception (detection/segmentation/tracking) with higher-level semantic understanding and reasoning capabilities. Experience with text, visual, and multimodal reasoning is a plus. The intern will work closely with MERL researchers to develop novel algorithms, design experiments using MERL’s in-house testbeds, and prepare results for patents and publication. The internship is expected to last 3 months, with a flexible start date.
Required Specific Experience
- Expertise in physical sensing across RF (radar, UWB, Wi-Fi), infrared, LiDAR, and event-camera modalities. Experienced with radar systems and concepts including FMCW and MIMO configurations, Doppler signature interpretation, radar point cloud and heatmap representations, and raw ADC waveforms;
- Solid understanding of state-of-the-art transformer-based (e.g., DETR) and diffusion-based (e.g., DiffusionDet) frameworks;
- Demonstrated work in text-, visual-, and multimodal semantic understanding and reasoning.
- Hands-on experience with open large-scale multi-sensor datasets (e.g., nuScenes, Waymo Open Dataset, Argoverse) and open radar datasets (e.g., MMVR, HIBER, RT-Pose, K-Radar).
- Proficiency in Python and deep learning frameworks (PyTorch/JAX), plus experience with GPU cluster job scheduling and scalable data pipelines.
- Proven publication record in top-tier venues such as CVPR, ICCV, ECCV, NeurIPS, ICLR, ICML (or equivalent).
The pay range for this internship position will be 6-8K per month.
- Research Areas: Artificial Intelligence, Computational Sensing, Machine Learning, Signal Processing
- Host: Perry Wang
- Apply Now
-
CI0213: Internship - Efficient Foundation Models for Edge Intelligence
Efficient Foundation Models for Edge Intelligence
We are seeking passionate and skilled interns to join our cutting-edge research team at Mitsubishi Electric Research Laboratories (MERL), focusing on efficient and sustainable AI. This internship offers a unique opportunity to contribute to next-generation machine learning techniques that enable real-time, edge, and energy-efficient AI systems — with the ultimate goal of publishing at top-tier AI venues.
Research Focus Areas
- Edge AI, real-time AI, and compact neural architectures
- Energy-efficient and hardware-friendly AI
- On-device, on-premise, and embedded-system AI
- Generative and multi-modal foundation models with resource constraints
Qualifications
- Advanced research experience in generative models, efficient architectures, or foundation models (LLM, VLM, LMM, FoMo)
- Strong understanding of state-of-the-art machine learning and optimization techniques
- Proficiency in Python and PyTorch, with familiarity in other deep learning frameworks
- Proven research record and motivation for publication in leading AI conferences
Internship Details
- Duration: Approximately 3 months
- Start Date: Flexible
- Objective: Conduct high-quality research leading to publications in premier AI conferences
If you are a highly motivated researcher eager to push the boundaries of efficient and sustainable AI, we encourage you to apply. Join us in shaping the future of intelligent systems that are not only powerful but also responsible and sustainable.
The pay range for this internship position will be 6-8K per month.
- Research Areas: Artificial Intelligence, Optimization, Signal Processing, Machine Learning, Computer Vision
- Host: Toshi Koike-Akino
- Apply Now
-
OR0299: Internship - Human-Robot Interaction
MERL is seeking a highly motivated and qualified intern to conduct research on human-robot interaction. There are several research topics of interest, including foundation models, shared autonomy, object handovers, learning from feedback, and safety in close proximity. Applicants must be Ph.D. students with strong backgrounds in robot learning or computer vision. The selected intern will collaborate closely with MERL researchers to design and implement novel algorithms, conduct experiments, and disseminate research findings through a top-tier conference. The start date and duration are flexible, and interested applicants are encouraged to apply with an updated CV and a list of relevant publications.
Required Specific Experience
- Demonstrated experience in human-robot interaction, robot learning, or vision-language models
- Experience with ROS2, Python, and deep learning frameworks such as PyTorch
- Current enrollment in a Ph.D. program
- A strong publication record or demonstrated research potential
The pay range for this internship position will be 6-8K per month.
- Research Areas: Robotics, Artificial Intelligence, Computer Vision
- Host: Siddarth Jain
- Apply Now
-
OR0298: Internship - Robotic Disassembly
MERL is seeking a highly motivated and qualified intern to conduct research on robotic disassembly. There are several research topics of interest, including task and sequence planning, learning skills, perception under occlusion, contact-rich manipulation, and vision-language models for acting under uncertainty. Applicants must be Ph.D. students with strong backgrounds in robot learning or computer vision. The selected intern will collaborate closely with MERL researchers to design and implement novel algorithms, conduct experiments, and disseminate research findings through a top-tier conference. The start date and duration are flexible, and interested applicants are encouraged to apply with an updated CV and a list of relevant publications.
Required Specific Experience
- Demonstrated experience in computer vision, robot learning, or vision-language models
- Experience with ROS2, Python, and deep learning frameworks such as PyTorch
- Current enrollment in a Ph.D. program
- A strong publication record or demonstrated research potential
The pay range for this internship position will be 6-8K per month.
- Research Areas: Robotics, Computer Vision, Artificial Intelligence
- Host: Siddarth Jain
- Apply Now
-
SA0191: Internship - Human-Robot Interaction Based on Multimodal Scene Understanding
We are looking for a graduate student interested in advancing the field of multimodal scene understanding, focusing on scene understanding using natural language for robot dialog and/or indoor monitoring with a large language model. The intern will collaborate with MERL researchers to derive and implement new models and optimization methods, conduct experiments, and prepare results for publication. Internships regularly lead to one or more publications in top-tier venues, which can later become part of the intern's doctoral work. The ideal candidates are senior Ph.D. students with experience in deep learning for audio-visual, signal, and natural language processing. Good programming skills in Python and knowledge of deep learning frameworks such as PyTorch are essential. Multiple positions are available with a flexible start date (not just Spring/Summer but throughout 2026) and duration (typically 3-6 months).
Required Specific Experience
- Experience with ROS2, C/C++, Python, and deep learning frameworks such as PyTorch are essential.
The pay range for this internship position will be 6-8K per month.
- Research Areas: Artificial Intelligence, Machine Learning, Robotics, Speech & Audio
- Host: Chiori Hori
- Apply Now
-
CV0075: Internship - Multimodal Embodied AI
MERL is looking for a self-motivated intern to work on problems at the intersection of multimodal large language models and embodied AI in dynamic indoor environments. The ideal candidate would be a PhD student with a strong background in machine learning and computer vision, as demonstrated by top-tier publications. The candidate must have prior experience in designing synthetic scenes (e.g., 3D games) using popular graphics software, embodied AI, large language models, reinforcement learning, and the use of simulators such as Habitat/SoundSpaces. Hands on experience in using animated 3D human shape models (e.g., SMPL and variants) is desired. The intern is expected to collaborate with researchers in computer vision at MERL to develop algorithms and prepare manuscripts for scientific publications.
Required Specific Experience
- Experience in designing 3D interactive scenes
- Experience with vision based embodied AI using simulators (implementation on real robotic hardware would be a plus).
- Experience training large language models on multimodal data
- Experience with training reinforcement learning algorithms
- Strong foundations in machine learning and programming
- Strong track record of publications in top-tier computer vision and machine learning venues (such as CVPR, NeurIPS, etc.).
- Research Areas: Artificial Intelligence, Computer Vision, Speech & Audio, Robotics, Machine Learning
- Host: Anoop Cherian
- Apply Now
-
CV0224: Internship - Language-Guided Human-Robot Interaction
MERL is looking for a self-motivated intern to research on the topic of language-guided dynamic human-robot interaction in simulations. The intern must have a strong background in state-of-the-art machine learning research including the knowledge of agentic AI technologies, toolboxes to train/fine-tune large vision-and-language models, as well as expertise working on simulation platforms such as AI Habitat or similar. The intern is expected to collaborate with researchers in the computer vision team at MERL to develop algorithms and prepare manuscripts for scientific publications.
Required Specific Experience
- Experience in realistic simulators, including AI Habitat, TDW, etc.
- Experience in modeling agentic pipelines for solving complex tasks, including assimilating multimodal data, natural language interaction, and physical reasoning.
- Strong computer vision and machine learning foundations, including reinforcement learning, training large vision-and-language models, etc.
- Strong track record of publications in top-tier computer vision and machine learning venues (such as CVPR, NeurIPS, etc.)
- Must be enrolled in a graduate program, ideally towards a Ph.D.
The pay range for this internship position will be 6-8K per month.
- Research Areas: Artificial Intelligence, Computer Vision, Machine Learning
- Host: Anoop Cherian
- Apply Now
-
CV0101: Internship - Multimodal Algorithmic Reasoning
MERL is looking for a self-motivated intern to research on problems at the intersection of multimodal large language models and neural algorithmic reasoning. An ideal intern would be a Ph.D. student with a strong background in machine learning and computer vision. The candidate must have prior experience with training multimodal LLMs for solving vision-and-language tasks. Experience in participating and winning mathematical Olympiads is desired. Publications in theoretical machine learning venues would be a strong plus. The intern is expected to collaborate with researchers in the computer vision team at MERL to develop algorithms and prepare manuscripts for scientific publications.
Required Specific Experience
- Experience with training large vision-and-language models
- Experience with solving mathematical reasoning problems
- Experience with programming in Python using PyTorch
- Enrolled in a PhD program
- Strong track record of publications in top-tier computer vision and machine learning venues (such as CVPR, NeurIPS, etc.).
- Research Areas: Artificial Intelligence, Computer Vision, Machine Learning
- Host: Anoop Cherian
- Apply Now
-
CV0225: Internship - Reconstruction/Novel View Synthesis of Dynamic Scenes
MERL is looking for a highly motivated intern to work on an original research project in reconstruction/rendering dynamic 3D scenes. A strong background in 3D computer vision and/or computer graphics is required. Experience in the latest advances of deep learning in this area, such as neural radiance fields (NeRFs)/Gaussian Splatting (GS)/Point Map reconstruction methods, is an added plus and will be valued. The successful candidate is expected to have published at least one paper in a top-tier computer vision/graphics or machine learning venue, such as CVPR, ECCV, ICCV, SIGGRAPH, 3DV, ICML, ICLR, NeurIPS or AAAI, and possess solid programming skills in Python and popular deep learning frameworks like Pytorch. The goal would be for such a candidate to collaborate with MERL researchers to develop algorithms and prepare manuscripts for scientific publications. The position is available for graduate students on a Ph.D. track or those that have recently graduated with a Ph.D. Duration and start dates are flexible but are expected to last for at least 3 months. This internship is preferred to be onsite at MERL’s office in Cambridge, MA.
Required Specific Experience
- Prior publications in top computer vision/graphics and/or machine learning venues, such as CVPR, ECCV, ICCV, SIGGRAPH, 3DV, ICML, ICLR, NeurIPS or AAAI.
- Experience in the latest novel-view synthesis approaches such as Neural Radiance Fields (NeRFs) or Gaussian Splatting (GS) and/or in the latest 3D point map reconstruction methods.
- Proficiency in coding (particularly scripting languages like Python) and familiarity with deep learning frameworks, such as PyTorch or Tensorflow.
The pay range for this internship position will be $6-8K per month.
- Research Areas: Computer Vision, Artificial Intelligence, Machine Learning
- Host: Moitreya Chatterjee
- Apply Now
-
EA0234: Internship - Multi-modal sensor fusion for predictive maintenance
Mitsubishi Electric Research Laboratories (MERL) is seeking a self-motivated Ph.D. candidate in Computer Science, Electrical Engineering, or a related field for a 3-month internship focused on developing advanced machine learning algorithms to fuse multi-modal time sequence data for electric machine condition monitoring and predictive maintenance. The ideal candidate will have a strong background in machine learning and signal processing with a proven publication record. Experience in time-sequence analysis, multimodal sensor fusion, or physics-informed machine learning is preferred. Knowledge of electric machines is a plus. The intern will collaborate with MERL researchers to design and develop novel algorithms, prepare technical reports, and contribute to manuscripts for top-tier scientific publications. This position requires onsite work at MERL, with a flexible start date.
Required Specific Experience
- Experience with multi-modal sensor fusion.
The pay range for this internship position will be 6-8K per month.
- Research Areas: Artificial Intelligence, Electric Systems, Signal Processing, Machine Learning
- Host: Dehong Liu
- Apply Now
-
CA0220: Internship - Visual Simultaneous Localization and Mapping (V-SLAM)
MERL seeks a self-motivated graduate student to conduct research on Visual Simultaneous Localization and Mapping (V-SLAM). Depending on the candidate’s expertise and interests, the internship may focus on topics such as — but not limited to — camera pose estimation, feature detection and matching, visual-LiDAR data fusion, pose-graph optimization, loop closure detection, and image-based camera relocalization.
The ideal candidate is a PhD student with a strong foundation in 3D computer vision and proficient programming skills in C/C++ and/or Python. Applicants should have at least one publication in a premier computer vision, machine learning, or robotics conference, such as CVPR, ECCV, ICCV, NeurIPS, ICRA, or IROS.
The intern will collaborate with MERL researchers to develop and implement novel algorithms for V-SLAM, perform experiments, and document research outcomes. The work is expected to lead to a submission to a top-tier conference. The start date and internship duration are flexible.
Required Specific Experience
- Experience with 3D Computer Vision and Simultaneous Localization & Mapping (SLAM).
The pay range for this internship position will be 6-8K per month.
- Research Areas: Artificial Intelligence, Computer Vision, Robotics
- Host: Pedro Miraldo
- Apply Now
-
CA0221: Internship - Robust Estimation for Computer Vision
MERL seeks a motivated graduate student to conduct research in robust estimation for computer vision. Depending on the candidate’s background and interests, the internship may involve topics such as — but not limited to — camera pose estimation, 3D registration, camera calibration, pose-graph optimization, or transformation averaging.
The ideal applicant is a PhD student with strong expertise in 3D computer vision, RANSAC, or graduated non-convexity algorithms, along with solid programming skills in C/C++ and/or Python. Candidates should have at least one publication in a leading computer vision, machine learning, or robotics venue (e.g., CVPR, ECCV, ICCV, NeurIPS, ICRA, or IROS).
The intern will work closely with MERL researchers to develop and implement new algorithms for visual SLAM (V-SLAM), perform experiments, and document results. The goal is to produce work suitable for submission to a top-tier conference. The start date and duration of the internship are flexible.
Required Specific Experience
- Demonstrated experience in 3D computer vision, RANSAC, or graduated non-convexity algorithms for vision applications.
The pay range for this internship position will be 6-8K per month.
- Research Areas: Artificial Intelligence, Computer Vision, Robotics, Optimization
- Host: Pedro Miraldo
- Apply Now