@Paolo Messina yes that’s quite possible. However, it all depends on the features and the data itself. If the dataset used does not provide enough vocabulary information about a language, then its possible to confuse the classifier. So should the performance degrade significantly, there are two fixes for this. (1) Look into the quality of the input data and (2) improve qualit of features (i.e. improve the predictive power of the classifier) .