Natural Language Processing with Python Updated EditionChapter 44

Practical Applications

Section 8 of 8-~ 12 min read-Synced from Cuantum content
  1. Using which function from scikit-learn can you transform text data into a Bag of Words representation?
  2. a) TfidfVectorizer

b) CountVectorizer

c) Word2Vec

d) BertTokenizer

  1. Which method from Gensim can you use to load pre-trained GloVe embeddings?
  2. a) gensim.load_glove()

b) api.load()

c) glove.load()

d) gensim.download_glove()

  1. What is the primary purpose of stop word removal in text preprocessing?
  2. a) To increase the length of text data.

b) To remove irrelevant or less informative words.

c) To convert text to uppercase.

d) To add punctuation to the text.