Data for download
Downloadable files with all the anglicisms extracted by Lazaro
Lazaro's extraction model improves with time. Consequently, the more recent the data, the more reliable it is. The anglicisms in these files were extracted automatically and haven't been reviewed in detail. As a result, they might contain errors.
The code of the extraction model and the training corpus are available in the GitHub repository