TR2014-024

Log-linear dialog manager

- Tang, H., Watanabe, S., Marks, T.K., Hershey, J.R., "Log-linear Dialog Manager", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), DOI: 10.1109/ICASSP.2014.6854371, May 2014, pp. 4092-4096.
  BibTeX TR2014-024 PDF
  - @inproceedings{Tang2014may,
  - author = {Tang, H. and Watanabe, S. and Marks, T.K. and Hershey, J.R.},
  - title = {{Log-linear Dialog Manager}},
  - booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
  - year = 2014,
  - pages = {4092--4096},
  - month = may,
  - publisher = {IEEE},
  - doi = {10.1109/ICASSP.2014.6854371},
  - url = {https://www.merl.com/publications/TR2014-024}
  - }
MERL Contact:
- Tim K.
  Marks
Research Areas:

Artificial Intelligence, Speech & Audio

Abstract:

We design a log-linear probabilistic model for solving the dialog management task. In both planning and learning we optimize the same objective function: the expected reward. Rather than performing full policy optimization, we perform on-line estimation of the optimal action as a belief-propagation inference step. We employ context-free grammars to describe our variable spaces, which enables us to define rich features. To scale our approach to large variable spaces, we use particle belief propagation. Experiments show that the model is able to choose system actions that yield a high expected reward, outperforming its POMDP-like log-linear counterpart and a hand-crafted rule-based system.

MERL Contact:

Tim K.Marks

Research Areas:

Abstract:

Tim K.
Marks