News & Events

385 were found.




  •  NEWS   MERL's breakthrough speech separation technology featured in Mitsubishi Electric Corporation's Annual R&D Open House
    Date: May 24, 2017
    Where: Tokyo, Japan
    MERL Contacts: Bret Harsham; John Hershey; Jonathan Le Roux
    Research Areas: Multimedia, Speech & Audio
    Brief
    • Mitsubishi Electric Corporation announced that it has created the world's first technology that separates in real time the simultaneous speech of multiple unknown speakers recorded with a single microphone. It's a key step towards building machines that can interact in noisy environments, in the same way that humans can have meaningful conversations in the presence of many other conversations. In tests, the simultaneous speeches of two and three people were separated with up to 90 and 80 percent accuracy, respectively. The novel technology, which was realized with Mitsubishi Electric's proprietary "Deep Clustering" method based on artificial intelligence (AI), is expected to contribute to more intelligible voice communications and more accurate automatic speech recognition. A characteristic feature of this approach is its versatility, in the sense that voices can be separated regardless of their language or the gender of the speakers. A live speech separation demonstration that took place on May 24 in Tokyo, Japan, was widely covered by the Japanese media, with reports by three of the main Japanese TV stations and multiple articles in print and online newspapers. The technology is based on recent research by MERL's Speech and Audio team.
      Links:
      Mitsubishi Electric Corporation Press Release
      MERL Deep Clustering Demo

      Media Coverage:

      Fuji TV, News, "Minna no Mirai" (Japanese)
      The Nikkei (Japanese)
      Nikkei Technology Online (Japanese)
      Sankei Biz (Japanese)
      EE Times Japan (Japanese)
      ITpro (Japanese)
      Nikkan Sports (Japanese)
      Nikkan Kogyo Shimbun (Japanese)
      Dempa Shimbun (Japanese)
      Il Sole 24 Ore (Italian)
      IEEE Spectrum (English)
  •  
  •  EVENT   Society for Industrial and Applied Mathematics panel for students on careers in industry
    Date & Time: Monday, July 10, 2017; 6:15 PM - 7:15 PM
    Speaker: Andrew Knyazev and other panelists, MERL
    MERL Contacts: Joseph Katz; Andrew Knyazev
    Location: David Lawrence Convention Center, Pittsburgh PA
    Research Areas: Electronics & Communications, Multimedia, Data Analytics, Computer Vision, Mechatronics, Algorithms, Advanced Control Systems, Computational Geometry, Computational Photography, Computational Sensing, Decision Optimization, Digital Video, Dynamical Systems, Information Security, Machine Learning, Optical Communications & Devices, Power & RF, Predictive Modeling, Speech & Audio, Wireless Communications & Signal Processing
    Brief
    • Andrew Knyazev accepted an invitation to represent MERL at the panel on Student Careers in Business, Industry and Government at the annual meeting of the Society for Industrial and Applied Mathematics (SIAM).

      The format consists of a five minute introduction by each of the panelists covering their background and an overview of the mathematical and computational challenges at their organization. The introductions will be followed by questions from the students.
  •  
  •  EVENT   MERL to participate in Xconomy Forum on AI & Robotics
    Date & Time: Tuesday, March 28, 2017; 1:30 - 5:30PM
    MERL Contacts: John Hershey; Joseph Katz; Daniel Nikovski; Alan Sullivan; Jay Thornton; Anthony Vetro; Richard (Dick) Waters; Jinyun Zhang
    Location: Google (355 Main St., 5th Floor, Cambridge MA)
    Research Areas: Multimedia, Data Analytics, Computer Vision, Mechatronics
    Brief
    • How will AI and robotics reshape the economy and create new opportunities (and challenges) across industries? Who are the hottest companies that will compete with the likes of Google, Amazon, and Uber to create the future? And what are New England innovators doing to strengthen the local cluster and help lead the national discussion?

      MERL will be participating in Xconomy's third annual conference on AI and robotics in Boston to address these questions. MERL President & CEO, Dick Waters, will be on a panel discussing the status and future of self-driving vehicles. Lab members will also be on hand demonstrate and discuss recent advances AI and robotics technology.

      The agenda and registration for the event can be found online: https://xconomyforum85.eventbrite.com
  •  
  •  TALK   Generative Model-Based Text-to-Speech Synthesis
    Date & Time: Wednesday, February 1, 2017; 12:00-13:00
    Speaker: Dr. Heiga ZEN, Google
    MERL Host: Chiori Hori
    Research Areas: Multimedia, Speech & Audio
    Brief
    • Recent progress in generative modeling has improved the naturalness of synthesized speech significantly. In this talk I will summarize these generative model-based approaches for speech synthesis such as WaveNet, a deep generative model of raw audio waveforms. We show that WaveNets are able to generate speech which mimics any human voice and which sounds more natural than the best existing Text-to-Speech systems.
      See https://deepmind.com/blog/wavenet-generative-model-raw-audio/ for further details.
  •  
  •  NEWS   MERL to present 10 papers at ICASSP 2017
    Date: March 5, 2017 - March 9, 2017
    Where: New Orleans
    MERL Contacts: Petros Boufounos; Chen Feng; John Hershey; Takaaki Hori; Jonathan Le Roux; Dehong Liu; Hassan Mansour; Dong Tian; Anthony Vetro; Ye Wang
    Research Areas: Multimedia, Computer Vision, Computational Geometry, Computational Sensing, Digital Video, Information Security, Speech & Audio
    Brief
    • MERL researchers will presented 10 papers at the upcoming IEEE International Conference on Acoustics, Speech & Signal Processing (ICASSP), to be held in New Orleans from March 5-9, 2017. Topics to be presented include recent advances in speech recognition and audio processing; graph signal processing; computational imaging; and privacy-preserving data analysis.

      ICASSP is the flagship conference of the IEEE Signal Processing Society, and the world's largest and most comprehensive technical conference focused on the research advances and latest technological development in signal and information processing. The event attracts more than 2000 participants each year.
  •  
  •  AWARD   APSIPA recognizes Anthony Vetro as a 2016 Industrial Distinguished Leader
    Date: October 15, 2016
    Awarded to: Anthony Vetro
    MERL Contact: Anthony Vetro
    Research Area: Multimedia
    Brief
    • Anthony Vetro was recognized by APSIPA (Asia-Pacific Signal and Information Processing Association) as a 2016 Industrial Distinguished Leader. This distinction is reserved for selected APSIPA members with extraordinary accomplishments in any of the fields related to APSIPA scope. A list of past recipients can be found online: http://www.apsipa.org/industrial.htm.
  •  
  •  TALK   High-Dimensional Analysis of Stochastic Optimization Algorithms for Estimation and Learning
    Date & Time: Tuesday, December 13, 2016; Noon
    Speaker: Yue M. Lu, John A. Paulson School of Engineering and Applied Sciences, Harvard University
    MERL Host: Petros Boufounos
    Research Areas: Multimedia, Computational Sensing, Machine Learning
    Brief
    • In this talk, we will present a framework for analyzing, in the high-dimensional limit, the exact dynamics of several stochastic optimization algorithms that arise in signal and information processing. For concreteness, we consider two prototypical problems: sparse principal component analysis and regularized linear regression (e.g. LASSO). For each case, we show that the time-varying estimates given by the algorithms will converge weakly to a deterministic "limiting process" in the high-dimensional limit. Moreover, this limiting process can be characterized as the unique solution of a nonlinear PDE, and it provides exact information regarding the asymptotic performance of the algorithms. For example, performance metrics such as the MSE, the cosine similarity and the misclassification rate in sparse support recovery can all be obtained by examining the deterministic limiting process. A steady-state analysis of the nonlinear PDE also reveals interesting phase transition phenomena related to the performance of the algorithms. Although our analysis is asymptotic in nature, numerical simulations show that the theoretical predictions are accurate for moderate signal dimensions.
  •  
  •  TALK   Collaborative dictionary learning from big, distributed data
    Date & Time: Friday, December 2, 2016; 11:00 AM
    Speaker: Prof. Waheed Bajwa, Rutgers University
    MERL Host: Petros Boufounos
    Research Areas: Multimedia, Computational Sensing
    Brief
    • While distributed information processing has a rich history, relatively less attention has been paid to the problem of collaborative learning of nonlinear geometric structures underlying data distributed across sites that are connected to each other in an arbitrary topology. In this talk, we discuss this problem in the context of collaborative dictionary learning from big, distributed data. It is assumed that a number of geographically-distributed, interconnected sites have massive local data and they are interested in collaboratively learning a low-dimensional geometric structure underlying these data. In contrast to some of the previous works on subspace-based data representations, we focus on the geometric structure of a union of subspaces (UoS). In this regard, we propose a distributed algorithm, termed cloud K-SVD, for collaborative learning of a UoS structure underlying distributed data of interest. The goal of cloud K-SVD is to learn an overcomplete dictionary at each individual site such that every sample in the distributed data can be represented through a small number of atoms of the learned dictionary. Cloud K-SVD accomplishes this goal without requiring communication of individual data samples between different sites. In this talk, we also theoretically characterize deviations of the dictionaries learned at individual sites by cloud K-SVD from a centralized solution. Finally, we numerically illustrate the efficacy of cloud K-SVD in the context of supervised training of nonlinear classsifiers from distributed, labaled training data.
  •  
  •  EVENT   MERL organizes Workshop on End-to-End Speech and Audio Processing at NIPS 2016
    Date: Saturday, December 10, 2016
    MERL Contact: John Hershey
    Location: Centre Convencions Internacional Barcelona, Barcelona SPAIN
    Research Areas: Multimedia, Machine Learning, Speech & Audio
    Brief
    • MERL researcher John Hershey, is organizing a Workshop on End-to-End Speech and Audio Processing, on behalf of MERL's Speech and Audio team, and in collaboration with Philemon Brakel of the University of Montreal. The workshop focuses on recent advances to end-to-end deep learning methods to address alignment and structured prediction problems that naturally arise in speech and audio processing. The all day workshop takes place on Saturday, December 10th at NIPS 2016, in Barcelona, Spain.
  •  
  •  EVENT   John Hershey to present tutorial at the 2016 IEEE SLT Workshop
    Date: Tuesday, December 13, 2016
    Speaker: John Hershey, MERL
    MERL Contacts: John Hershey; Jonathan Le Roux
    Location: 2016 IEEE Spoken Language Technology Workshop, San Diego, California
    Research Areas: Multimedia, Machine Learning, Speech & Audio
    Brief
    • MERL researcher John Hershey presents an invited tutorial at the 2016 IEEE Workshop on Spoken Language Technology, in San Diego, California. The topic, "developing novel deep neural network architectures from probabilistic models" stems from MERL work with collaborators Jonathan Le Roux and Shinji Watanabe, on a principled framework that seeks to improve our understanding of deep neural networks, and draws inspiration for new types of deep network from the arsenal of principles and tools developed over the years for conventional probabilistic models. The tutorial covers a range of parallel ideas in the literature that have formed a recent trend, as well as their application to speech and language.
  •  
  •  EVENT   2016 IEEE Workshop on Spoken Language Technology: Sponsored by MERL
    Date: Tuesday, December 13, 2016 - Friday, December 16, 2016
    MERL Contact: John Hershey
    Location: San Diego, California
    Research Areas: Multimedia, Speech & Audio
    Brief
    • The IEEE Workshop on Spoken Language Technology is a premier international showcase for advances in spoken language technology. The theme for 2016 is "machine learning: from signal to concepts," which reflects the current excitement about end-to-end learning in speech and language processing. This year, MERL is showing its support for SLT as one of its top sponsors, along with Amazon and Microsoft.
  •  
  •  EVENT   MERL Open House
    Date & Time: Thursday, December 8, 2016; 4:00-7:00pm
    MERL Contacts: Elizabeth Phillips; Anthony Vetro
    Location: 201 Broadway, 8th Floor, Cambridge, MA
    Research Areas: Electronics & Communications, Multimedia, Data Analytics, Computer Vision, Mechatronics, Algorithms, Business Innovation
    Brief
    • Snacks, demos, science: On Thursday 12/8, Mitsubishi Electric Research Labs (MERL) will host an open house for graduate+ students interested in internships, post-docs, and research scientist positions. The event will be held from 4-7pm and will feature demos & short presentations in our main areas of research: algorithms, multimedia, electronics, communications, computer vision, speech processing, optimization, machine learning, data analytics, mechatronics, dynamics, control, and robotics. MERL is a high impact publication-oriented research lab with very extensive internship and university collaboration programs. Most internships lead to publication; many of our interns and staff have gone on to notable careers at MERL and in academia. Come mix with our researchers, see our state of the art technologies, and learn about our research opportunities. Dress code: casual, with resumes.

      Pre-registration for the event is strongly encouraged:
      https://www.eventbrite.com/e/merl-open-house-tickets-29408503626

      Current internship and employment openings:
      http://www.merl.com/internship/openings
      http://www.merl.com/employment/employment
  •  
  •  EVENT   MERL participating in Engineering Career Fair
    Date & Time: Wednesday, November 16, 2016; 3:30-6:30pm
    MERL Contacts: Elizabeth Phillips; Anthony Vetro
    Location: Sheraton Commander (16 Garden Street, Cambridge, MA)
    Research Areas: Electronics & Communications, Multimedia, Data Analytics, Computer Vision, Mechatronics, Algorithms, Business Innovation
    Brief
    • MERL will be participating in the Engineering Career Fair Collaborative, which is being held on November 16, 2016 at the Sheraton Commander in Cambridge from 3:30-6:30pm. Graduate students with an interest in learning about internship and other employment opportunities at MERL are invited to visit our booth. Staff members will be on hand to discuss current openings. We will also be showing some demonstrations of current research projects.

      Current internship and employment openings:
      http://www.merl.com/internship/openings
      http://www.merl.com/employment/employment
  •  
  •  EVENT   SANE 2016 - Speech and Audio in the Northeast
    Date: Friday, October 21, 2016
    MERL Contacts: John Hershey; Jonathan Le Roux
    Location: MIT, McGovern Institute for Brain Research, Cambridge, MA
    Research Areas: Multimedia, Speech & Audio
    Brief
    • SANE 2016, a one-day event gathering researchers and students in speech and audio from the Northeast of the American continent, will be held on Friday October 21, 2016 at MIT's Brain and Cognitive Sciences Department, at the McGovern Institute for Brain Research, in Cambridge, MA.

      It is a follow-up to SANE 2012 (Mitsubishi Electric Research Labs - MERL), SANE 2013 (Columbia University), SANE 2014 (MIT CSAIL), and SANE 2015 (Google NY). Since the first edition, the audience has steadily grown, gathering 140 researchers and students in 2015.

      SANE 2016 will feature invited talks by leading researchers: Juan P. Bello (NYU), William T. Freeman (MIT/Google), Nima Mesgarani (Columbia University), DAn Ellis (Google), Shinji Watanabe (MERL), Josh McDermott (MIT), and Jesse Engel (Google). It will also feature a lively poster session during lunch time, open to both students and researchers.

      SANE 2016 is organized by Jonathan Le Roux (MERL), Josh McDermott (MIT), Jim Glass (MIT), and John R. Hershey (MERL).
  •  
  •  NEWS   MERL Speech & Audio researchers present two sold-out tutorials at Interspeech 2016
    Date: September 8, 2016
    Where: Interspeech 2016, San Francisco, CA
    MERL Contact: Jonathan Le Roux
    Research Areas: Multimedia, Speech & Audio
    Brief
    • MERL Speech and Audio Team researchers Shinji Watanabe and Jonathan Le Roux presented two tutorials on September 8 at the Interspeech 2016 conference, held in San Francisco, CA. Shinji collaborated with Marc Delcroix (NTT Communication Science Laboratories, Japan) to deliver a three-hour lecture on "Recent Advances in Distant Speech Recognition", drawing upon their experience organizing and participating in six different recent robust speech processing challenges. Jonathan teamed with Emmanuel Vincent (Inria, France) and Hakan Erdogan (Sabanci University, Microsoft Research) to give an in-depth tour of the latest advances in "Learning-based Approaches to Speech Enhancement And Separation". This collaboration stemmed from extensive stays at MERL by Emmanuel and Hakan, Emmanuel as a summer visitor, and Hakan as a MERL visiting research scientist for over a year while on sabbatical.

      Both tutorials were sold out, each attracting more than 100 researchers and students in related fields, and received high praise from audience members.
  •  
  •  EVENT   MERL Hosts 2nd Annual Women In Science Celebration
    Date & Time: Friday, July 22, 2016; 12:00 Noon
    MERL Contacts: Elizabeth Phillips; Jinyun Zhang
    Location: Cambridge Brewery
    Research Areas: Electronics & Communications, Multimedia, Data Analytics, Computer Vision, Mechatronics, Algorithms
    Brief
    • MERL hosted its 2nd Annual "Women In Science Celebration". MERL's current team of female interns discussed and celebrated the contributions they've made during their internships at MERL.
  •  
  •  EVENT   MERL celebrates 25 years of innovation
    Date: Thursday, June 2, 2016
    MERL Contacts: Elizabeth Phillips; Anthony Vetro
    Location: Norton's Woods Conference Center at American Academy of Arts & Sciences, Cambridge, MA
    Research Areas: Electronics & Communications, Multimedia, Data Analytics, Computer Vision, Mechatronics, Algorithms, Business Innovation
    Brief
    • MERL celebrated 25 years of innovation on Thursday, June 2 at the Norton's Woods Conference Center at the American Academy of Arts & Sciences in Cambridge, MA. The event was a great success, with inspiring keynote talks, insightful panel sessions, and an exciting research showcase of MERL's latest breakthroughs.

      Please visit the event page to view photos of each session, video presentations, as well as a commemorative booklet that highlights past and current research.
  •  
  •  NEWS   MERL makes a strong showing at the American Control Conference
    Date: July 6, 2016 - July 8, 2016
    Where: American Control Conference (ACC)
    MERL Contacts: Mouhacine Benosman; Scott Bortoff; Petros Boufounos; Daniel Burns; Claus Danielson; Stefano Di Cairano; Amir-massoud Farahmand; Abraham Goldsmith; Piyush Grover; Uros Kalabic; Andrew Knyazev; Christopher Laughman; Daniel Nikovski; Arvind Raghunathan; Yebin Wang; Avishai Weiss
    Research Areas: Multimedia, Data Analytics, Mechatronics, Business Innovation, Advanced Control Systems, Dynamical Systems, Machine Learning, Predictive Modeling
    Brief
    • The premier American Control Conference (ACC) takes place in Boston July 6-8. This year MERL researchers will present a record 20 papers(!) at ACC, with several contributions, especially in autonomous vehicle path planning and in Model Predictive Control (MPC) theory and applications, including manufacturing machines, electric motors, satellite station keeping, and HVAC. Other important themes developed in MERL's presentations concern adaptation, learning, and optimization in control systems.
  •  
  •  TALK   Speech structure and its application to speech processing -- Relational, holistic and abstract representation of speech
    Date & Time: Friday, June 3, 2016; 1:30PM - 3:00PM
    Speaker: Nobuaki Minematsu and Daisuke Saito, The University of Tokyo
    Research Areas: Multimedia, Speech & Audio
    Brief
    • Speech signals covey various kinds of information, which are grouped into two kinds, linguistic and extra-linguistic information. Many speech applications, however, focus on only a single aspect of speech. For example, speech recognizers try to extract only word identity from speech and speaker recognizers extract only speaker identity. Here, irrelevant features are often treated as hidden or latent by applying the probability theory to a large number of samples or the irrelevant features are normalized to have quasi-standard values. In speech analysis, however, phases are usually removed, not hidden or normalized, and pitch harmonics are also removed, not hidden or normalized. The resulting speech spectrum still contains both linguistic information and extra-linguistic information. Is there any good method to remove extra-linguistic information from the spectrum? In this talk, our answer to that question is introduced, called speech structure. Extra-linguistic variation can be modeled as feature space transformation and our speech structure is based on the transform-invariance of f-divergence. This proposal was inspired by findings in classical studies of structural phonology and recent studies of developmental psychology. Speech structure has been applied to accent clustering, speech recognition, and language identification. These applications are also explained in the talk.
  •  
  •  EVENT   John Hershey Invited to Speak at Deep Learning Summit 2016 in Boston
    Date: Thursday, May 12, 2016 - Friday, May 13, 2016
    MERL Contact: John Hershey
    Location: Deep Learning Summit, Boston, MA
    Research Areas: Multimedia, Speech & Audio
    Brief
    • MERL Speech and Audio Senior Team Leader John Hershey is among a set of high-profile researchers invited to speak at the Deep Learning Summit 2016 in Boston on May 12-13, 2016. John will present the team's groundbreaking work on general sound separation using a novel deep learning framework called Deep Clustering. For the first time, an artificial intelligence is able to crack the half-century-old "cocktail party problem", that is, to isolate the speech of a single person from a mixture of multiple unknown speakers, as humans do when having a conversation in a loud crowd.
  •