2019 Spring Internship Program:

A Spoken Dialogue System Enabling Various Intention Levels of Information Seeking Behaviors

Term: February – May 2019 (the schedule is negotiable)

Place: Perceptual Computing Laboratory, Waseda University, Tokyo


The Perceptual Computing Laboratory at Waseda University is seeking several spring research interns starting from February 2019 (the schedule is negotiable), who will work on an incremental and adaptive spoken dialogue system enabling a whole new way of information creation, distribution and consumption for the coming decades. Conversational information processing is essentially different from other existing conventional interactive media, such as web services. Such conversational media require users’ explicit and implicit “pull” and “push” requests to retrieve a certain amount of information in natural ways. A long article on Wikipedia, for example, is not optimized for information consumption when using a conversational user interface – such written documents need to be fragmented and translated into spoken forms as pre-compiled utterance plans, which should be properly controlled on-the-fly when being consumed.

Herein, we propose a spoken dialogue system enabling various intention levels of information seeking behaviors. The proposed system largely consists of an utterance planner and a dialogue manager. Given a set of news articles, the utterance planner pre-compiles an utterance plan that consists of a primary plan for delivering main news content, and the associated subsidiary plans for supplementing the main content. A primary plan is automatically generated by applying text summarization and style conversion techniques. The subsidiary plans are compiled by considering potential user/system interactions. With the utterance plans, the dialogue manager first classified user’s possible passive/active behaviors, and then selects the corresponding utterances. We empirically confirmed that our system was able to deliver the news content smoothly while dynamically adapting to the change of user’s intention levels.

Automatic pre-compiled utterance plan generation process

Goals of the Internship Program

The group has been developing a Japanese version of the proposed spoken dialogue system for years and plans to extend the functionalities towards a large-scaled real application in collaborations with corporate partners. The interns are expected to design and implement an English version of the proposed system based on the original Japanese version and conduct further user studies under the guidance of senior researchers. Due to the nature of the project, this project is ideal for students who want to gain experiences of both fundamental research and holistic implementation of spoken dialogue systems. More specifically, the internship program has two major goals:

  1. Document summarization and text translation from written documents into spoken utterances (for primary plans)
  2. Non-factoid-based question answering related to the current documents (for subsidiary plans)

The interns also can explore new functionalities towards natural conversational experiences with his/her own ideas. The completed work is expected to be published at major conferences and journals, such as ACL, SIGDIAL, IEEE/ACM Transactions on Audio, Speech, and Language Processing. The interns will be able to get intensive advice and guidance from multiple faculty mentors in the group with multiple backgrounds (machine learning, speech and language processing, and dialogue systems). Although the original reference system was implemented in Japanese, the mentors will provide the interns necessary information in English.

Demo video (early Japanese version):


Minimum Education and Competencies:
Candidates should have fundamental knowledge and hands-on experiences in the following fields:

  1. Machine learning
  2. Speech and language processing
  3. (Spoken) dialogue systems
  4. English communication skills in written and oral forms

Desirable Competencies:

  1. Speech synthesis and/or recognition
  2. Document summarization
  3. Question-answering
  4. Machine translation

Financial Supports

Accepted interns will be supported the following costs by Waseda University:

  • Round-trip economy class air ticket
  • Allowance, transportation and accommodation expenses: basic supports according to the regulation of Waseda University.
  • Travel insurance

About Perceptual Computing Laboratory and Waseda University

Perceptual Computing Laboratory at Waseda University, led by professor Tetsunori Kobayashi, investigates theoretical machine learning and pattern recognition and their integrated applications, such as conversational robots and intelligent interfaces, as well as their development paradigms. The group consists of more than 10 faculty members (professors, associate and assistant professors) who have strong backgrounds of machine learning, pattern recognition, natural language processing, dialogue systems, human-robot interactions, Internet of Things, crowd-sourcing. Waseda University is a private, independent research university in central Tokyo. Since 1882, Waseda has tirelessly challenged convention, in favor of progress and innovation. Waseda is particularly famous as an international student-friendly university hosting the largest number of foreign students in Japan.


Interested applicant should send CVs with cover letters, via email to the following address:

Yoichi Matsuyama

Associate Research Professor
Perceptual Computing Laboratory
Waseda University, Green Computing System R&D Center 27 Waseda, Room 40-701,
Shinjuku-ku, Tokyo, 162-0042, Japan

Leave a Comment

Your email address will not be published.