Response to Speech Synthesis feedback of 2019-02-12

This topic has 5 replies, 1 voice, and was last updated 6 years, 3 months ago by Simon.

Viewing 5 reply threads

Author

Posts
- February 14, 2019 at 14:44 #9688
  Simon
  Professor
  Thank you for the useful feedback (from 25 responses), regarding the Speech Synthesis course. I have clustered your comments, and my responses are below.
  
  I have briefly summarised the most frequently mentioned positive points without adding any response, then responded to each negative point made by at least 2 people. Please do feel free to follow up and offer more feedback or clarification.
- February 14, 2019 at 14:54 #9689
  Simon
  Professor
  Positive comments
  
  Number of people mentioning each point is given in parentheses.
  
  Group work / interactive classes (13)
  
  The videos (8) and subtitles / transcripts (3)
  
  Flipped classroom format (7)
  
  Labs (6) and specifically the tutor (2)
  
  Milestones for the assignment (4)
  
  speech.zone in general, including content, navigation (4)
- February 14, 2019 at 14:57 #9690
  Simon
  Professor
  Lab tasks for each week could be clearer / labs could be more structured
  
  We will provide more class-wide instructions during the remaining lab sessions, whilst still leaving plenty of time for individual help.
- February 14, 2019 at 15:00 #9691
  Simon
  Professor
  More instructions on shell scripting / why can’t we use Python
  
  The voice building ‘recipe’ we are using is written as shell or Scheme scripts, and it’s not easy to change that. Learning more shell scripting (in addition to what was covered in Speech Processing) is an important aspect of the course.
  
  There some shell scripting help on the forums and I am always happy to add to that, in response to specific questions and requests.
  
  You can use Python for everything you implement yourself, such as a text selection algorithm.
- February 14, 2019 at 15:02 #9692
  Simon
  Professor
  Could the assignment be done in pairs or groups?
  
  I’m not a fan of pair or group work, where a single report is submitted and therefore all members of the group receive the same mark. My experience with observing this in other courses is that some students do a lot more work than others.
  
  You can of course work together in the lab, discussing the theory and practice of speech synthesis, talking about what limited domain you might use, what kinds of hypotheses make sense, or which tool to use to implement a listening test, etc. But you must then execute the work yourself.
- February 14, 2019 at 15:05 #9693
  Simon
  Professor
  The course should have more material on neural networks and state-of-the-art methods such as Wavenet, right from the start
  
  The course will cover all of these things, don’t worry. But it makes no sense to jump into state-of-the-art models until the foundations are solid, so that we can motivate why particular approaches are the state-of-the-art.
  
  In other courses that I am familiar with, the state-of-the-art is covered at the start of the course, but then I find students lack the foundations and simply don’t really understand the material. They then have to backtrack and learn the foundations that are needed, sometimes on their own.
  
  As it happens, Wavenet, in its original text-to-waveform configuration, was only very briefly the state-of-the-art and has already been superseded That’s another reason to do the state-of-the-art at the end of the course, so we can include the very latest approaches from this fast-moving field.
Author

Posts

Viewing 5 reply threads

You must be logged in to reply to this topic.

Response to Speech Synthesis feedback of 2019-02-12

Search the forums

Note

Latest Activity

Search the forums

Speech Synthesis