This is the link to the dataset.
Please cite the following two papers if you are using this dataset.
1. Rouzbeh A. Shirvani, Mario Piergallini, Gauri S. Gautam and Mohamed Chouikha, “Word-Level Language Identification and Predicting Codeswitching Points in Swahili-English Language Data”, Conference on Empirical Methods in Natural Language Processing, Austin, Texas, USA, November 2016.
2. Mario Piergallini, Rouzbeh A. Shirvani, Gauri S. Gautam and Mohamed Chouikha, “System submission for language identification in Spanish-English Codeswitching”, Conference on Empirical Methods in Natural Language Processing, Austin, Texas, USA, November 2016.