Winograd T. (1973), A Procedural Model of Language Understanding. In: Grosz B., Spärck Jones K., Webber B. (eds) Readings in Natural Language Processing. Morgan Kaufmann, San Francisco
Spärck Jones K. (1994) Natural Language Processing: A Historical Review. In: Zampolli A., Calzolari N., Palmer M. (eds) Current Issues in Computational Linguistics: In Honour of Don Walker. Linguistica Computazionale, vol 9. Springer, Dordrecht
There are plenty of descriptions of how concepts of information theory apply (or don’t) to language, but I personally find George Miller’s essay from 1953 to be extremely readable and helpful: