An important application scenario in nltk snowball extraction of stem Machine learning is machine automatic classification, and the key to classification is stem extraction. So we need to use snowball. The following describes two methods for extracting the stem from snowball.
Two methods:
Method 1:
>>> From nltk import SnowballStemmer
>>> SnowballStemmer. supported ages # See which supported ages are supported
('Danish', 'utch', 'English ', 'Finnish', 'French ', 'German', 'Hungarian ',
'Italian ', 'norwegianc', 'porter', 'Portuguese ", 'Romanian ',
'Russian ', 'spance', 'Swedish ')
>>> Stemmer = SnowballStemmer ("german") # Choose a language
>>> Stemmer. stem (u "Autobahnen") # Stem a word
U'autobahn'
However, when you know your language scenario, you can use the following method to call it directly:
Method 2:
>>> Ps = nltk. stem. snowball. PortugueseStemmer ()
>>> Ps. stem ('celular ')
U'celul'
>>> Ps. stem ('celular ')
U'celul'