Links to data used in Sproat & Jaitly (https://arxiv.org/abs/1611.00068) experiments, with data for Polish added by request.
The data is all on Google storage:
https://storage.googleapis.com/text-normalization/README
https://storage.googleapis.com/text-normalization/LICENSE
https://storage.googleapis.com/text-normalization/CONTRIBUTING
https://storage.googleapis.com/text-normalization/en_with_types.tgz
https://storage.googleapis.com/text-normalization/ru_with_types.tgz
https://storage.googleapis.com/text-normalization/pl_with_types.tgz