Written with a traditional "starting from automatic speech recognition" viewpoint, you will need to make the connections for yourself to the more general concept of text-to-speech as a regression problem.
in Paul Taylor “Text-to-speech synthesis”, 2009, Cambridge University Press, Cambridge, ISBN 0521899273
Note that the terminology “context-sensitive modelling” used in this chapter means exactly the same as “context-dependent modelling” in other work.