News & Events

393 were found.


  •  EVENT   ISCAS 2013 - IEEE International Symposium on Circuits & Systems
    Date: Sunday, May 19, 2013 - Thursday, May 23, 2013
    MERL Contact: Anthony Vetro
    Location: Beijing, China
    Research Area: Multimedia
    Brief
    • Anthony Vetro is the Demo Co-chair of ISCAS 2013, the IEEE International Symposium on Circuits & Systems, to be held in Beijing, China, in May 2013.
  •  
  •  NEWS   Conference on Telecommunications (Conftele) 2013: publication by Anthony Vetro, Dong Tian and others
    Date: May 8, 2013
    Where: Conference on Telecommunications (Conftele)
    MERL Contacts: Dong Tian; Anthony Vetro
    Research Areas: Multimedia, Digital Video
    Brief
    • The paper "Analysis of Depth Map Resampling Filters for Depth-based 3D Video Coding" by Graziosi, D.B., Rodrigues, N.M.M., de Faria, S.M.M., Tian, D. and Vetro, A. was presented at the Conference on Telecommunications (Conftele)
  •  
  •  TALK   Practical kernel methods for automatic speech recognition
    Date & Time: Tuesday, May 7, 2013; 2:30 PM
    Speaker: Dr. Yotaro Kubo, NTT Communication Science Laboratories, Kyoto, Japan
    Research Areas: Multimedia, Speech & Audio
    Brief
    • Kernel methods are important to realize both convexity in estimation and ability to represent nonlinear classification. However, in automatic speech recognition fields, kernel methods are not widely used conventionally. In this presentation, I will introduce several attempts to practically incorporate kernel methods into acoustic models for automatic speech recognition. The presentation will consist of two parts. The first part will describes maximum entropy discrimination and its application to a kernel machine training. The second part will describes dimensionality reduction of kernel-based features.
  •  
  •  TALK   Visual Signal Analysis and Compression: Focus on Texture Similarity
    Date & Time: Friday, May 3, 2013; 12:00 PM
    Speaker: Prof. Thrasyvoulos N. Pappas, Northwestern University
    MERL Host: Anthony Vetro
    Research Area: Multimedia
    Brief
    • Texture is an important visual attribute both for human perception and image analysis systems. We present new structural texture similarity metrics and applications that critically depend on such metrics, with
      emphasis on image compression and content-based retrieval. The new metrics account for human visual perception and the stochastic nature of textures. They rely entirely on local image statistics and allow substantial point-by-point deviations between textures that according to human judgment are similar or essentially identical.

      We also present new testing procedures for objective texture similarity metrics. We identify three operating domains for evaluating the performance of such similarity metrics: the top of the similarity scale, where a monotonic relationship between metric values and subjective scores is desired; the ability to distinguish between perceptually similar and dissimilar textures; and the ability to retrieve "identical" textures. Each domain has different performance goals and requires different testing procedures. Experimental results similarity metrics demonstrate both the performance of the proposed metrics and the effectiveness of the proposed subjective testing procedures.
  •  
  •  NEWS   ICLR 2013: publication by Jonathan Le Roux and others
    Date: May 2, 2013
    Where: International Conference on Learning Representations (ICLR)
    MERL Contact: Jonathan Le Roux
    Research Areas: Multimedia, Speech & Audio
    Brief
    • The paper "Block Coordinate Descent for Sparse NMF" by Potluru, V.K., Plis, S.M., Le Roux, J., Pearlmutter, B.A., Calhoun, V.D. and Hayes, T.P. was presented at the International Conference on Learning Representations (ICLR)
  •  
  •  NEWS   Emerging Technologies for 3D Video: Creation, Coding, Transmission and Rendering: publication by Anthony Vetro and others
    Date: May 1, 2013
    Where: Emerging Technologies for 3D Video: Creation, Coding, Transmission and Rendering
    MERL Contact: Anthony Vetro
    Research Areas: Multimedia, Digital Video
    Brief
    • The article "Depth Based 3D Video Formats and Coding Technology" by Vetro, A. and Muller, K. was published in the book Emerging Technologies for 3D Video: Creation, Coding, Transmission and Rendering
  •  
  •  TALK   Signal Processing on Graphs: Theory and Applications
    Date & Time: Thursday, March 21, 2013; 12:00 PM
    Speaker: Prof. Antonio Ortega, University of Southern California
    MERL Host: Anthony Vetro
    Research Area: Multimedia
    Brief
    • Graphs have long been used in a wide variety of problems, such analysis of social networks, machine learning, network protocol optimization, decoding of LDPCs or image processing. Techniques based on spectral graph theory provide a "frequency" interpretation of graph data and have proven to be quite popular in multiple applications.

      In the last few years, a growing amount of work has started extending and complementing spectral graph techniques, leading to the emergence of "Graph Signal Processing" as a broad research field. A common characteristic of this recent work is that it considers the data attached to the vertices as a "graph-signal" and seeks to create new techniques (filtering, sampling, interpolation), similar to those commonly used in conventional signal processing (for audio, images or video), so that they can be applied to these graph signals.

      In this talk, we first introduce some of the basic tools needed in developing new graph signal processing operations. We then introduce our design of wavelet filterbanks of graphs, which for the first time provides a multi-resolution, critically-sampled, frequency- and graph-localized transforms for graph signals. We conclude by providing several examples of how these new transforms and tools can be applied to existing problems. Time permitting, we will discuss applications to image processing, depth video compression, recommendation system design and network optimization.
  •  
  •  TALK   Communication/computation tradeoffs and other practical considerations in distributed convex optimization
    Date & Time: Thursday, March 21, 2013; 12:00 PM
    Speaker: Konstantinos Tsianos, McGill, Montreal, Canada
    MERL Host: Petros Boufounos
    Research Area: Multimedia
    Brief
    • Distributed algorithms become necessary to employ the computational resources needed for solving the large scale optimization problems that arise in areas such as machine learning,computation biology and others. We study a very general distributed setting where the data is distributed over many machines that can communicate with one another over a network that does not have any specialized communication infrastructure. In this setting the role of the network becomes critical in the performance of a distributed algorithm. From a more theoretical standpoint we discuss two questions: 1) How many nodes should we use for a given problem before communication becomes a bottleneck? and 2) How often should the nodes communicate to one another for the communication cost to be worth the transmission? In addition, we discuss some more practical issue that one needs to consider in implementing algorithms that are asynchronous and robust to communication delays
  •  
  •  NEWS   DCC 2013: publication by Petros T. Boufounos and Shantanu D. Rane
    Date: March 20, 2013
    Where: Data Compression Conference (DCC)
    MERL Contact: Petros Boufounos
    Research Areas: Multimedia, Computational Sensing
    Brief
    • The paper "Efficient Coding of Signal Distances Using Universal Quantized Embeddings" by Boufounos, P.T. and Rane, S. was presented at the Data Compression Conference (DCC)
  •  
  •  NEWS   Journal of Machine Learning Research (JMLR): publication by Petros T. Boufounos and others
    Date: March 1, 2013
    Where: Journal of Machine Learning Research (JMLR)
    MERL Contact: Petros Boufounos
    Research Areas: Multimedia, Computational Sensing
    Brief
    • The article "Greedy Sparsity-Constrained Optimization" by Bahmani, S., Raj, B. and Boufounos, P. was published in Journal of Machine Learning Research (JMLR)
  •  
  •  NEWS   IEEE Signal Processing Letters: publication by Jonathan Le Roux and others
    Date: March 1, 2013
    Where: IEEE Signal Processing Letters
    MERL Contact: Jonathan Le Roux
    Research Areas: Multimedia, Speech & Audio
    Brief
    • The article "Consistent Wiener Filtering for Audio Source Separation" by Le Roux, J. and Vincent, E. was published in IEEE Signal Processing Letters
  •  
  •  TALK   Probabilistic Latent Tensor Factorisation
    Date & Time: Tuesday, February 26, 2013; 12:00 PM
    Speaker: Prof. Taylan Cemgil, Bogazici University, Istanbul, Turkey
    MERL Host: Jonathan Le Roux
    Research Areas: Multimedia, Speech & Audio
    Brief
    • Algorithms for decompositions of matrices are of central importance in machine learning, signal processing and information retrieval, with SVD and NMF (Nonnegative Matrix Factorisation) being the most widely used examples. Probabilistic interpretations of matrix factorisation models are also well known and are useful in many applications (Salakhutdinov and Mnih 2008; Cemgil 2009; Fevotte et. al. 2009). In the recent years, decompositions of multiway arrays, known as tensor factorisations have gained significant popularity for the analysis of large data sets with more than two entities (Kolda and Bader, 2009; Cichocki et. al. 2008). We will discuss a subset of these models from a statistical modelling perspective, building upon probabilistic Bayesian generative models and generalised linear models (McCulloch and Nelder). In both views, the factorisation is implicit in a well-defined hierarchical statistical model and factorisations can be computed via maximum likelihood.

      We express a tensor factorisation model using a factor graph and the factor tensors are optimised iteratively. In each iteration, the update equation can be implemented by a message passing algorithm, reminiscent to variable elimination in a discrete graphical model. This setting provides a structured and efficient approach that enables very easy development of application specific custom models, as well as algorithms for the so called coupled (collective) factorisations where an arbitrary set of tensors are factorised simultaneously with shared factors. Extensions to full Bayesian inference for model selection, via variational approximations or MCMC are also feasible. Well known models of multiway analysis such as Nonnegative Matrix Factorisation (NMF), Parafac, Tucker, and audio processing (Convolutive NMF, NMF2D, SF-SSNTF) appear as special cases and new extensions can easily be developed. We will illustrate the approach with applications in link prediction and audio and music processing.
  •  
  •  NEWS   IEEE Signal Processing Magazine: 2 publications by Petros T. Boufounos and Shantanu D. Rane
    Date: February 13, 2013
    Where: IEEE Signal Processing Magazine
    MERL Contact: Petros Boufounos
    Research Area: Multimedia
    Brief
    • The articles "Privacy-Preserving Nearest Neighbor Methods: Comparing Signals without Revealing Them" by Rane, S. and Boufounos, P.T. and "Privacy-preserving Speech Processing: Cryptographic and String-Matching Frameworks Show Promise" by Pathak, M.A., Raj, B., Rane, S. and Samaragdis, P. were published in IEEE Signal Processing Magazine
  •  
  •  TALK   Bayesian Group Sparse Learning
    Date & Time: Monday, January 28, 2013; 11:00 AM
    Speaker: Prof. Jen-Tzung Chien, National Chiao Tung University, Taiwan
    Research Areas: Multimedia, Speech & Audio
    Brief
    • Bayesian learning provides attractive tools to model, analyze, search, recognize and understand real-world data. In this talk, I will introduce a new Bayesian group sparse learning and its application on speech recognition and signal separation. First of all, I present the group sparse hidden Markov models (GS-HMMs) where a sequence of acoustic features is driven by Markov chain and each feature vector is represented by two groups of basis vectors. The features across states and within states are represented accordingly. The sparse prior is imposed by introducing the Laplacian scale mixture (LSM) distribution. The robustness of speech recognition is illustrated. On the other hand, the LSM distribution is also incorporated into Bayesian group sparse learning based on the nonnegative matrix factorization (NMF). This approach is developed to estimate the reconstructed rhythmic and harmonic music signals from single-channel source signal. The Monte Carlo procedure is presented to infer two groups of parameters. The future work of Bayesian learning shall be discussed.
  •  
  •  NEWS   IEEE Transactions on Information Theory: publication by Petros T. Boufounos and others
    Date: January 23, 2013
    Where: IEEE Transactions on Information Theory
    MERL Contact: Petros Boufounos
    Research Areas: Multimedia, Computational Sensing
    Brief
    • The article "Robust 1-Bit Compressive Sensing via Binary Stable Embeddings of Sparse Vectors" by Jacques, L., Laska, J.N., Boufounos, P.T. and Baraniuk, R.G. was published in IEEE Transactions on Information Theory
  •  
  •  TALK   Electromagnetic Remote Sensing for the Detection of Concealed Objects
    Date & Time: Thursday, December 13, 2012; 12:00 PM
    Speaker: Dr. Tomasz M. Grzegorczyk, Delpsi LLC
    MERL Host: Anthony Vetro
    Research Area: Multimedia
    Brief
    • Electromagnetic (EM) remote sensing is a well-established modality for the detection, tracking, and identification of concealed targets. The degree of freedom offered by the operating frequency (and the associated propagation or induction regimes) make EM waves sufficiently versatile to interrogate both large as well as small structures, metallic as well as dielectric objects, in close proximity or further away. This wide flexibility has made EM remote sensing a modality of choice in many applications. This presentation will focus on two implementations of non-destructive and non-contact EM sensing. The first is based on a tomographic approach, whereby EM waves are used to infer material properties within the volume of accessible structures. The two examples to be discussed are breast cancer detection, i.e. locating areas of high vascularity in otherwise healthy biological tissues, and inspection of concrete structures, i.e. identifying volumetric material property variations to locate rebars and cracks. The second area we will discuss is that of subsurface target detection, with again two very different applications. The first pertains to ground penetrating radars with frequencies in the GHz aimed at the detection of buried weak dielectric scatterers, whereas the second focuses on the detection of metallic targets in the magnetic induction regime, for which much lower frequencies are used. In all these applications, the data collected by the appropriate hardwares are processed by combining fundamental EM concepts with inverse methods for parameter estimation. We will discuss both a deterministic method -- Gauss-Newton -- and a stochastic method -- Kalman filters for real time target detection.
  •  
  •  TALK   Speech recognition for closed-captioning
    Date & Time: Tuesday, December 11, 2012; 12:00 PM
    Speaker: Takahiro Oku, NHK Science & Technology Research Laboratories
    Research Areas: Multimedia, Speech & Audio
    Brief
    • In this talk, I will present human-friendly broadcasting research conducted in NHK and research on speech recognition for real-time closed-captioning. The goal of human-friendly broadcasting research is to make broadcasting more accessible and enjoyable for everyone, including children, elderly, and physically challenged persons. The automatic speech recognition technology that NHK has developed makes it possible to create captions for the hearing impaired in real-time automatically. For sports programs such as professional sumo wrestling, a closed-captioning system has already been implemented in which captions are created by using speech recognition on a captioning re-speaker. In 2011, NHK General Television started broadcasting of closed captions for the information program "Morning Market". After the introduction of the implemented closed-captioning system, I will talk about our recent improvement obtained by an adaptation method that creates a more effective acoustic model using error correction results. The method reflects recognition error tendencies more effectively.
  •  
  •  NEWS   APSIPA Transactions on Signal and Information Processing: publication by Shinji Watanabe and others
    Date: December 6, 2012
    Where: APSIPA Transactions on Signal and Information Processing
    Research Areas: Multimedia, Speech & Audio
    Brief
    • The article "Bayesian Approaches to Acoustic Modeling: A Review" by Watanabe, S. and Nakamura, A. was published in APSIPA Transactions on Signal and Information Processing
  •  
  •  NEWS   Asia-Pacific Signal & Information Processing Association Annual Summit and Conference 2012: 2 publications by Anthony Vetro, Huifang Sun, Robert A. Cohen and Dong Tian
    Date: December 3, 2012
    Where: Asia-Pacific Signal & Information Processing Association Annual Summit and Conference
    MERL Contacts: Dong Tian; Robert Cohen; Anthony Vetro; Huifang Sun
    Research Area: Multimedia
    Brief
    • The papers "Depth Map Up-sampling Based on Edge Layers" by Graziosi, D.B., Tian, D. and Vetro, A. and "Joint Perceptually-based Intra Prediction and Quantization for HEVC" by Jin, G., Cohen, R., Vetro, A. and Sun, H. were presented at the Asia-Pacific Signal & Information Processing Association Annual Summit and Conference
  •  
  •  EVENT   APSIPA 2012
    Date: Monday, December 3, 2012 - Thursday, December 6, 2012
    MERL Contact: Anthony Vetro
    Location: Hollywood, CA
    Research Area: Multimedia
    Brief
    • MERL is a sponsor for APSIPA 2012, the fourth annual conference organized by Asia-Pacific Signal and Information Processing Association.
  •