Natural Language Understanding has widespread applications in today’s world. It is being used to detect user sentiments, extract form-based information from text data, summarize documents, etc. All of these applications, however, are possible only when we can encode words into numerical representations, because machines only understand numbers and not text. We can project words into a vector space or hidden space (hidden because we don’t know how to interpret these features) where words with similar meanings have similar vectors. These vectors are also called the Word Embeddings for that particular word. The major advances in Deep Learning for difficult Natural Language Processing tasks would not have been possible without this method of representing words and documents.
In this case study, we are going to predict the ratings of an app based upon its reviews using the NLP techniques we have learnt so far, which are categorized into 3 classes – good, neutral and poor. This data set has been scraped from the Google Play Store. This notebook covers different classifier models that we use for prediction purposes and the task of finding which one gives the highest accuracy.
The data contains over 12,000 reviews of different app store applications by real users. The data also contains the rating that was given by them, so it can be classified into positive or negative reviews.
#!pip install tqdm
from tqdm import tqdm
import numpy as np
import pandas as pd
# To visualise the graphs
import matplotlib.pyplot as plt
import seaborn as sns
# Helps to extract the data using regular expressions
import re
# Helps to visualize the wordcloud
!pip install wordcloud
from wordcloud import WordCloud, STOPWORDS, ImageColorGenerator
!pip install gensim
from gensim.models import Word2Vec
import nltk
from nltk.corpus import stopwords
# Used in Tokenization
from nltk.tokenize import word_tokenize, sent_tokenize
from sklearn.model_selection import train_test_split
# Used to encode the categorical variables
from sklearn.preprocessing import LabelEncoder
# Importing the Random Forest model
from sklearn.ensemble import RandomForestClassifier
# Metrics to evaluate the model
from sklearn.metrics import accuracy_score,classification_report
import warnings
# Ignore all warnings
warnings.filterwarnings("ignore")
from sklearn.preprocessing import LabelEncoder
# Loading the dataset into a dataframe
df = pd.read_csv("3.2+new_reviews.csv")
df.head(5)
content | label | |
---|---|---|
0 | I cannot open the app anymore | poor |
1 | I have been begging for a refund from this app… | poor |
2 | Very costly for the premium version (approx In… | poor |
3 | Used to keep me organized, but all the 2020 UP… | poor |
4 | Dan Birthday Oct 28 | poor |
#Checking the info of the data
df.info()
<class 'pandas.core.frame.DataFrame'> RangeIndex: 12495 entries, 0 to 12494 Data columns (total 2 columns): # Column Non-Null Count Dtype --- ------ -------------- ----- 0 content 12495 non-null object 1 label 12495 non-null object dtypes: object(2) memory usage: 195.4+ KB
Observations:
#Check duplicate entries
df[df['content'].duplicated()]['content']
83 Very bad 327 Not free 435 good 524 Super 583 Good ... 12468 Very useful 12472 Excellent 12473 love it 12476 good 12485 good Name: content, Length: 688, dtype: object
Observations:
Now, our main focus will be to build the word embeddings for the text data.
So, let’s consider only the content
column along with the target variable score
.
data = df.copy()
# Let's view the count of each class
import seaborn as sns
sns.countplot(x=data['label'], data=data)
<AxesSubplot:xlabel='label', ylabel='count'>
Observations:
# Let's look at the top 150 unique words in original feedback from the users
all_texts = " ".join(texts for texts in data.content)
print ("There are {} words in the combination of all texts.".format(len(all_texts)))
# Lower max_font_size, change the maximum number of word and lighten the background:
wordcloud = WordCloud(max_font_size = 40, max_words = 150, background_color="white").generate(all_texts)
plt.figure(figsize = (8,16))
plt.imshow(wordcloud)
plt.axis("off")
plt.show()
There are 1858712 words in the combination of all texts.
Observation:
Let’s now clean the data before going on to build the word embeddings for the text.
nltk.download('stopwords')
[nltk_data] Downloading package stopwords to C:\Users\Sanjay Kumar [nltk_data] CJ\AppData\Roaming\nltk_data... [nltk_data] Package stopwords is already up-to-date!
True
Regex Expressions
A regular expression matches a broad or specific text pattern, and is strictly read left-to-right.
Let’s apply the expression on a single data point, and see the results:
data['content'][0:10]
0 I cannot open the app anymore 1 I have been begging for a refund from this app... 2 Very costly for the premium version (approx In... 3 Used to keep me organized, but all the 2020 UP... 4 Dan Birthday Oct 28 5 It has changed how I viewed my different lists... 6 I'm only looking for a grocery list app but ev... 7 Reset my free trial, new phone I'd like to see... 8 How do to stop monthly payment because i don't... 9 I complain about not crashes and it was immedi... Name: content, dtype: object
print("Before applying the expression :", data['content'][3])
pattern = '[^a-zA-Z0-9]'
res = re.sub(pattern, ' ', data['content'][3])
print("After applying the expression :", res)
Before applying the expression : Used to keep me organized, but all the 2020 UPDATES have made a mess of things !!! Y cudn't u leave well enuf alone ??? Guess ur techies feel the need to keep making changes to justify continuing to collect their salary !!! 🤤🤤🤤 After applying the expression : Used to keep me organized but all the 2020 UPDATES have made a mess of things Y cudn t u leave well enuf alone Guess ur techies feel the need to keep making changes to justify continuing to collect their salary
We can clearly observe that we have removed the special characters and numbers from the text. Also, we observe that there are extra spaces in many places in the text – let’s remove these using another regular expression.
print("Before applying the expression :", res)
pattern = '\s+'
resf = re.sub(pattern, ' ', res)
print("After applying the expression :", resf)
Before applying the expression : Used to keep me organized but all the 2020 UPDATES have made a mess of things Y cudn t u leave well enuf alone Guess ur techies feel the need to keep making changes to justify continuing to collect their salary After applying the expression : Used to keep me organized but all the 2020 UPDATES have made a mess of things Y cudn t u leave well enuf alone Guess ur techies feel the need to keep making changes to justify continuing to collect their salary
Let’s remove the numbers and other numerals from the data. The range [0-9] is not always equivalent to \d. In python3, [0-9] matches only 0123456789 characters, while \d matches [0-9] and other digit characters, for example Eastern Arabic numerals ٠١٢٣٤٥٦٧٨٩.
print("Before applying the expression :", resf)
pattern = '\d+'
resl = re.sub(pattern, ' ', resf)
print("After applying the expression :", resl)
Before applying the expression : Used to keep me organized but all the 2020 UPDATES have made a mess of things Y cudn t u leave well enuf alone Guess ur techies feel the need to keep making changes to justify continuing to collect their salary After applying the expression : Used to keep me organized but all the UPDATES have made a mess of things Y cudn t u leave well enuf alone Guess ur techies feel the need to keep making changes to justify continuing to collect their salary
final_corpus = []
for i in range(data.shape[0]):
# It removes the numbers, special characters, spaces
text = re.sub('[^a-zA-Z0-9]', ' ', data['content'][i])
# Lowering the text
text = text.lower()
# To remove the extra spaces in the data
text = re.sub(r'\s+',' ',text)
# To remove the numbers and other arabic numeric symbols
text = re.sub(r'\d',' ',text)
# Splitting the text
text = text.split()
# Removing the stopwords
text = [word for word in text if not word in stopwords.words('english')]
# joining the words into text
text = ' '.join(text)
# appending the text to the final_corpus list
final_corpus.append(text)
data['final_text'] = final_corpus
Let’s look at the top 150 unique words in the feedback where we have applied some preprocessing steps:
all_texts1 = " ".join(texts for texts in data.final_text)
print ("There are {} words in the combination of all texts.".format(len(all_texts1)))
# Lower max_font_size, change the maximum number of word and lighten the background:
wordcloud = WordCloud(max_font_size = 40, max_words = 150, background_color="white").generate(all_texts1)
plt.figure(figsize = (8,16))
plt.imshow(wordcloud)
plt.axis("off")
plt.show()
There are 1155388 words in the combination of all texts.
Word Embeddings are a word representation type that represents words with a 1-Dimensional vector. The representations are such that words with similar meanings or similar semantics will have similar vectors. This is a Natural Language Understanding technique to map words into vectors of real numbers using various ML methods such as Neural Networks, probabilistic models, or dimensionality reduction on the word co-occurrence matrix. Some important word embedding models are Word2vec (Google), GloVe (Stanford), and FastText (Facebook).
In this notebook, we are going to look at the Word2vec and GloVe implementations of word embeddings.
words_list =[]
for i in data['final_text']:
li = list(i.split(" "))
words_list.append(li)
# Let's have a look into words_list
words_list[0:5]
[['cannot', 'open', 'app', 'anymore'], ['begging', 'refund', 'app', 'month', 'nobody', 'replying'], ['costly', 'premium', 'version', 'approx', 'indian', 'rupees', 'per', 'year', 'better', 'download', 'premium', 'version', 'app', 'apkmos', 'website', 'use', 'microsoft', 'list', 'app', 'far', 'better'], ['used', 'keep', 'organized', 'updates', 'made', 'mess', 'things', 'cudn', 'u', 'leave', 'well', 'enuf', 'alone', 'guess', 'ur', 'techies', 'feel', 'need', 'keep', 'making', 'changes', 'justify', 'continuing', 'collect', 'salary'], ['dan', 'birthday', 'oct']]
Word2vec takes the following three important parameters:
Min_count: It will ignore all the words with a total frequency lower than this.
Workers: These are the threads to train the model.
!pip install --user gensim==3.8.3
Collecting gensim==3.8.3 Using cached gensim-3.8.3.tar.gz (23.4 MB) Preparing metadata (setup.py): started Preparing metadata (setup.py): finished with status 'done' Requirement already satisfied: numpy>=1.11.3 in c:\users\sanjay kumar cj\appdata\local\programs\python\python310\lib\site-packages (from gensim==3.8.3) (1.21.6) Requirement already satisfied: scipy>=0.18.1 in c:\users\sanjay kumar cj\appdata\local\programs\python\python310\lib\site-packages (from gensim==3.8.3) (1.8.1) Requirement already satisfied: six>=1.5.0 in c:\users\sanjay kumar cj\appdata\roaming\python\python310\site-packages (from gensim==3.8.3) (1.16.0) Requirement already satisfied: smart_open>=1.8.1 in c:\users\sanjay kumar cj\appdata\local\programs\python\python310\lib\site-packages (from gensim==3.8.3) (6.3.0) Building wheels for collected packages: gensim Building wheel for gensim (setup.py): started Building wheel for gensim (setup.py): finished with status 'error' Running setup.py clean for gensim Failed to build gensim Installing collected packages: gensim Running setup.py install for gensim: started Running setup.py install for gensim: finished with status 'error'
error: subprocess-exited-with-error python setup.py bdist_wheel did not run successfully. exit code: 1 [454 lines of output] running bdist_wheel running build running build_py creating build creating build\lib.win-amd64-3.10 creating build\lib.win-amd64-3.10\gensim copying gensim\downloader.py -> build\lib.win-amd64-3.10\gensim copying gensim\interfaces.py -> build\lib.win-amd64-3.10\gensim copying gensim\matutils.py -> build\lib.win-amd64-3.10\gensim copying gensim\nosy.py -> build\lib.win-amd64-3.10\gensim copying gensim\utils.py -> build\lib.win-amd64-3.10\gensim copying gensim\__init__.py -> build\lib.win-amd64-3.10\gensim creating build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\bleicorpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\csvcorpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\dictionary.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\hashdictionary.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\indexedcorpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\lowcorpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\malletcorpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\mmcorpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\sharded_corpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\svmlightcorpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\textcorpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\ucicorpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\wikicorpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\__init__.py -> build\lib.win-amd64-3.10\gensim\corpora creating build\lib.win-amd64-3.10\gensim\models copying gensim\models\atmodel.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\basemodel.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\base_any2vec.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\callbacks.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\coherencemodel.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\doc2vec.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\fasttext.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\hdpmodel.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\keyedvectors.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\ldamodel.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\ldamulticore.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\ldaseqmodel.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\lda_dispatcher.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\lda_worker.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\logentropy_model.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\lsimodel.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\lsi_dispatcher.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\lsi_worker.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\nmf.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\normmodel.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\phrases.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\poincare.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\rpmodel.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\tfidfmodel.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\translation_matrix.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\utils_any2vec.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\word2vec.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\_fasttext_bin.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\__init__.py -> build\lib.win-amd64-3.10\gensim\models creating build\lib.win-amd64-3.10\gensim\parsing copying gensim\parsing\porter.py -> build\lib.win-amd64-3.10\gensim\parsing copying gensim\parsing\preprocessing.py -> build\lib.win-amd64-3.10\gensim\parsing copying gensim\parsing\__init__.py -> build\lib.win-amd64-3.10\gensim\parsing creating build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\glove2word2vec.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\make_wiki.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\make_wikicorpus.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\make_wiki_lemma.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\make_wiki_online.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\make_wiki_online_lemma.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\make_wiki_online_nodebug.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\package_info.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\segment_wiki.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\word2vec2tensor.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\word2vec_standalone.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\__init__.py -> build\lib.win-amd64-3.10\gensim\scripts creating build\lib.win-amd64-3.10\gensim\similarities copying gensim\similarities\docsim.py -> build\lib.win-amd64-3.10\gensim\similarities copying gensim\similarities\index.py -> build\lib.win-amd64-3.10\gensim\similarities copying gensim\similarities\levenshtein.py -> build\lib.win-amd64-3.10\gensim\similarities copying gensim\similarities\nmslib.py -> build\lib.win-amd64-3.10\gensim\similarities copying gensim\similarities\termsim.py -> build\lib.win-amd64-3.10\gensim\similarities copying gensim\similarities\__init__.py -> build\lib.win-amd64-3.10\gensim\similarities creating build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\atmodel.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\d2vmodel.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\ftmodel.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\hdp.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\ldamodel.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\ldaseqmodel.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\lsimodel.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\phrases.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\rpmodel.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\text2bow.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\tfidf.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\w2vmodel.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\__init__.py -> build\lib.win-amd64-3.10\gensim\sklearn_api creating build\lib.win-amd64-3.10\gensim\summarization copying gensim\summarization\bm25.py -> build\lib.win-amd64-3.10\gensim\summarization copying gensim\summarization\commons.py -> build\lib.win-amd64-3.10\gensim\summarization copying gensim\summarization\graph.py -> build\lib.win-amd64-3.10\gensim\summarization copying gensim\summarization\keywords.py -> build\lib.win-amd64-3.10\gensim\summarization copying gensim\summarization\mz_entropy.py -> build\lib.win-amd64-3.10\gensim\summarization copying gensim\summarization\pagerank_weighted.py -> build\lib.win-amd64-3.10\gensim\summarization copying gensim\summarization\summarizer.py -> build\lib.win-amd64-3.10\gensim\summarization copying gensim\summarization\syntactic_unit.py -> build\lib.win-amd64-3.10\gensim\summarization copying gensim\summarization\textcleaner.py -> build\lib.win-amd64-3.10\gensim\summarization copying gensim\summarization\__init__.py -> build\lib.win-amd64-3.10\gensim\summarization creating build\lib.win-amd64-3.10\gensim\test copying gensim\test\basetmtests.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\simspeed.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\simspeed2.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\svd_error.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_aggregation.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_api.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_atmodel.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_big.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_BM25.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_coherencemodel.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_corpora.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_corpora_dictionary.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_corpora_hashdictionary.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_d2vmodel.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_datatype.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_direct_confirmation.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_doc2vec.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_dtm.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_fasttext.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_fasttext_wrapper.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_glove2word2vec.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_hdpmodel.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_indirect_confirmation.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_keras_integration.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_keyedvectors.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_keywords.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_ldamallet_wrapper.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_ldamodel.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_ldaseqmodel.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_ldavowpalwabbit_wrapper.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_lda_callback.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_lee.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_logentropy_model.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_lsimodel.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_matutils.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_miislita.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_nmf.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_normmodel.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_parsing.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_phrases.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_poincare.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_probability_estimation.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_rpmodel.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_scripts.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_segmentation.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_sharded_corpus.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_similarities.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_similarity_metrics.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_sklearn_api.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_summarization.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_text_analysis.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_tfidfmodel.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_tmdiff.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_translation_matrix.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_utils.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_utils_any2vec.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_varembed_wrapper.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_word2vec.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_wordrank_wrapper.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\utils.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\__init__.py -> build\lib.win-amd64-3.10\gensim\test creating build\lib.win-amd64-3.10\gensim\topic_coherence copying gensim\topic_coherence\aggregation.py -> build\lib.win-amd64-3.10\gensim\topic_coherence copying gensim\topic_coherence\direct_confirmation_measure.py -> build\lib.win-amd64-3.10\gensim\topic_coherence copying gensim\topic_coherence\indirect_confirmation_measure.py -> build\lib.win-amd64-3.10\gensim\topic_coherence copying gensim\topic_coherence\probability_estimation.py -> build\lib.win-amd64-3.10\gensim\topic_coherence copying gensim\topic_coherence\segmentation.py -> build\lib.win-amd64-3.10\gensim\topic_coherence copying gensim\topic_coherence\text_analysis.py -> build\lib.win-amd64-3.10\gensim\topic_coherence copying gensim\topic_coherence\__init__.py -> build\lib.win-amd64-3.10\gensim\topic_coherence creating build\lib.win-amd64-3.10\gensim\viz copying gensim\viz\poincare.py -> build\lib.win-amd64-3.10\gensim\viz copying gensim\viz\__init__.py -> build\lib.win-amd64-3.10\gensim\viz creating build\lib.win-amd64-3.10\gensim\models\deprecated copying gensim\models\deprecated\doc2vec.py -> build\lib.win-amd64-3.10\gensim\models\deprecated copying gensim\models\deprecated\fasttext.py -> build\lib.win-amd64-3.10\gensim\models\deprecated copying gensim\models\deprecated\fasttext_wrapper.py -> build\lib.win-amd64-3.10\gensim\models\deprecated copying gensim\models\deprecated\keyedvectors.py -> build\lib.win-amd64-3.10\gensim\models\deprecated copying gensim\models\deprecated\old_saveload.py -> build\lib.win-amd64-3.10\gensim\models\deprecated copying gensim\models\deprecated\word2vec.py -> build\lib.win-amd64-3.10\gensim\models\deprecated copying gensim\models\deprecated\__init__.py -> build\lib.win-amd64-3.10\gensim\models\deprecated creating build\lib.win-amd64-3.10\gensim\models\wrappers copying gensim\models\wrappers\dtmmodel.py -> build\lib.win-amd64-3.10\gensim\models\wrappers copying gensim\models\wrappers\fasttext.py -> build\lib.win-amd64-3.10\gensim\models\wrappers copying gensim\models\wrappers\ldamallet.py -> build\lib.win-amd64-3.10\gensim\models\wrappers copying gensim\models\wrappers\ldavowpalwabbit.py -> build\lib.win-amd64-3.10\gensim\models\wrappers copying gensim\models\wrappers\varembed.py -> build\lib.win-amd64-3.10\gensim\models\wrappers copying gensim\models\wrappers\wordrank.py -> build\lib.win-amd64-3.10\gensim\models\wrappers copying gensim\models\wrappers\__init__.py -> build\lib.win-amd64-3.10\gensim\models\wrappers running egg_info writing gensim.egg-info\PKG-INFO writing dependency_links to gensim.egg-info\dependency_links.txt writing requirements to gensim.egg-info\requires.txt writing top-level names to gensim.egg-info\top_level.txt reading manifest file 'gensim.egg-info\SOURCES.txt' reading manifest template 'MANIFEST.in' warning: no files found matching 'COPYING.LESSER' warning: no files found matching 'ez_setup.py' warning: no files found matching 'gensim\models\doc2vec_inner.c' adding license file 'COPYING' writing manifest file 'gensim.egg-info\SOURCES.txt' copying gensim\_matutils.c -> build\lib.win-amd64-3.10\gensim copying gensim\_matutils.pyx -> build\lib.win-amd64-3.10\gensim copying gensim\corpora\_mmreader.c -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\models\_utils_any2vec.c -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\doc2vec_corpusfile.cpp -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\doc2vec_inner.cpp -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\fasttext_corpusfile.cpp -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\fasttext_inner.c -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\nmf_pgd.c -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\word2vec_corpusfile.cpp -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\word2vec_inner.c -> build\lib.win-amd64-3.10\gensim\models copying gensim\corpora\_mmreader.pyx -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\models\_utils_any2vec.pyx -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\doc2vec_corpusfile.pyx -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\doc2vec_inner.pxd -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\doc2vec_inner.pyx -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\fast_line_sentence.h -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\fasttext_corpusfile.pyx -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\fasttext_inner.pxd -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\fasttext_inner.pyx -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\nmf_pgd.pyx -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\stdint_wrapper.h -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\voidptr.h -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\word2vec_corpusfile.pxd -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\word2vec_corpusfile.pyx -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\word2vec_inner.pxd -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\word2vec_inner.pyx -> build\lib.win-amd64-3.10\gensim\models creating build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\EN.1-10.cbow1_wind5_hs0_neg10_size300_smpl1e-05.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\IT.1-10.cbow1_wind5_hs0_neg10_size300_smpl1e-05.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\OPUS_en_it_europarl_train_one2ten.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\alldata-id-10.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\atmodel_3_0_1_model -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\atmodel_3_0_1_model.expElogbeta.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\atmodel_3_0_1_model.id2word -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\atmodel_3_0_1_model.state -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\bgwiki-latest-pages-articles-shortened.xml.bz2 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\compatible-hash-false.model -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\compatible-hash-true.model -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\cp852_fasttext.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\crime-and-punishment.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\crime-and-punishment.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\crime-and-punishment.vec -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\d2v-lee-v0.13.0 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\doc2vec_old -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\doc2vec_old_sep -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\doc2vec_old_sep.syn0_lockf.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\doc2vec_old_sep.syn1neg.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\dtm_test.dict -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\dtm_test.mm -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\enwiki-latest-pages-articles1.xml-p000000010p000030302-shortened.bz2 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\enwiki-table-markup.xml.bz2 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\euclidean_vectors.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\fasttext_old -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\fasttext_old_sep -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\fasttext_old_sep.syn0_lockf.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\fasttext_old_sep.syn1neg.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\fb-ngrams.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ft_kv_3.6.0.model.gz -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ft_model_2.3.0 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\head500.noblanks.cor -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\head500.noblanks.cor.bz2 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\head500.noblanks.cor_tfidf.model -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\head500.noblanks.cor_wordids.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\high_precision.kv.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\high_precision.kv.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\large_tag_doc_10_iter50 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\lda_3_0_1_model -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\lda_3_0_1_model.expElogbeta.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\lda_3_0_1_model.id2word -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\lda_3_0_1_model.state -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ldamodel_python_2_7 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ldamodel_python_2_7.expElogbeta.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ldamodel_python_2_7.id2word -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ldamodel_python_2_7.state -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ldamodel_python_3_5 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ldamodel_python_3_5.expElogbeta.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ldamodel_python_3_5.id2word -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ldamodel_python_3_5.state -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ldavowpalwabbit.dict.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ldavowpalwabbit.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\lee.cor -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\lee_background.cor -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\lee_fasttext -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\lee_fasttext.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\lee_fasttext.vec -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\lee_fasttext_new.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\miIslita.cor -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\mihalcea_tarau.kw.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\mihalcea_tarau.kwpos.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\mihalcea_tarau.summ.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\mihalcea_tarau.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\mini_newsgroup -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\nmf_model -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\non_ascii_fasttext.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\old_keyedvectors_320.dat -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\pang_lee_polarity.cor -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\pang_lee_polarity_fasttext.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\pang_lee_polarity_fasttext.vec -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\para2para_text1.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\para2para_text2.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\phraser-3.6.0.model -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\phraser-no-common-terms.pkl -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\phraser-no-scoring.pkl -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\phraser-scoring-str.pkl -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\phrases-3.6.0.model -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\phrases-no-common-terms.pkl -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\phrases-no-scoring.pkl -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\phrases-scoring-str.pkl -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\phrases-transformer-new-v3-5-0.pkl -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\phrases-transformer-v3-5-0.pkl -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\poincare_cp852.tsv -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\poincare_hypernyms.tsv -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\poincare_hypernyms_large.tsv -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\poincare_test_3.4.0 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\poincare_utf8.tsv -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\poincare_vectors.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\pre_0_13_2_model -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\pre_0_13_2_model.state -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\pretrained.vec -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\questions-words.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\reproduce.dat -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\reproduce.dat.gz -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\similarities0-1.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\simlex999.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\small_tag_doc_5_iter50 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\test_corpus_ok.mm -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\test_corpus_small.mm -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\test_glove.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\test_mmcorpus_corrupt.mm -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\test_mmcorpus_no_index.mm -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\test_mmcorpus_no_index.mm.bz2 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\test_mmcorpus_no_index.mm.gz -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\test_mmcorpus_overflow.mm -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\test_mmcorpus_with_index.mm -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\test_mmcorpus_with_index.mm.index -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.blei -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.blei.index -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.blei.vocab -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.low -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.low.index -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.mallet -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.mallet.index -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.mm -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.mm.index -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.svmlight -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.svmlight.index -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.uci -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.uci.index -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.uci.vocab -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.xml.bz2 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus_serialization.mm.index -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testlowdistinctwords.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testrepeatedkeywords.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testsummarization_unrelated.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\tfidf_model.tst -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\tfidf_model.tst.bz2 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\tfidf_model_3_2.tst -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\toy-data.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\toy-model-pretrained.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\toy-model.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\toy-model.vec -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\varembed_lee_subcorpus.cor -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\varembed_morfessor.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\varembed_vectors.pkl -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\w2v-lee-v0.12.0 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\w2v_keyedvectors_load_test.modeldata -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\w2v_keyedvectors_load_test.vocab -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_3.3 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_old -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_old_sep -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_old_sep.syn0_lockf.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_old_sep.syn1neg.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_c -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_py2 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_py3 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_py3_4 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py2 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py2.neg_labels.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py2.syn0.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py2.syn0_lockf.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py2.syn1neg.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py3 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py3.neg_labels.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py3.syn0.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py3.syn0_lockf.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py3.syn1neg.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py3_4 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py3_4.neg_labels.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py3_4.syn0.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py3_4.syn0_lockf.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py3_4.syn1neg.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\wordsim353.tsv -> build\lib.win-amd64-3.10\gensim\test\test_data creating build\lib.win-amd64-3.10\gensim\test\test_data\DTM copying gensim\test\test_data\DTM\ldaseq_3_0_1_model -> build\lib.win-amd64-3.10\gensim\test\test_data\DTM copying gensim\test\test_data\DTM\sstats_test.txt -> build\lib.win-amd64-3.10\gensim\test\test_data\DTM creating build\lib.win-amd64-3.10\gensim\test\test_data\PathLineSentences copying gensim\test\test_data\PathLineSentences\1.txt -> build\lib.win-amd64-3.10\gensim\test\test_data\PathLineSentences copying gensim\test\test_data\PathLineSentences\2.txt.bz2 -> build\lib.win-amd64-3.10\gensim\test\test_data\PathLineSentences creating build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_0.12.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_0.12.1.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_0.12.2.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_0.12.3.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_0.12.4.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_0.13.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_0.13.1.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_0.13.2.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_0.13.3.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_0.13.4.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_1.0.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_1.0.1.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_2.0.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_2.1.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_2.2.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_2.3.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_3.0.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_3.1.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_3.2.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_3.3.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_3.4.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models creating build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_0.12.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_0.12.1.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_0.12.2.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_0.12.3.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_0.12.4.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_0.13.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_0.13.1.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_0.13.2.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_0.13.3.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_0.13.4.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_1.0.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_1.0.1.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_2.0.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_2.1.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_2.2.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_2.3.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_3.0.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_3.1.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_3.2.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_3.3.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_3.4.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models running build_ext building 'gensim.models.word2vec_inner' extension error: Microsoft Visual C++ 14.0 or greater is required. Get it with "Microsoft C++ Build Tools": https://visualstudio.microsoft.com/visual-cpp-build-tools/ [end of output] note: This error originates from a subprocess, and is likely not a problem with pip. ERROR: Failed building wheel for gensim error: subprocess-exited-with-error Running setup.py install for gensim did not run successfully. exit code: 1 [454 lines of output] running install running build running build_py creating build creating build\lib.win-amd64-3.10 creating build\lib.win-amd64-3.10\gensim copying gensim\downloader.py -> build\lib.win-amd64-3.10\gensim copying gensim\interfaces.py -> build\lib.win-amd64-3.10\gensim copying gensim\matutils.py -> build\lib.win-amd64-3.10\gensim copying gensim\nosy.py -> build\lib.win-amd64-3.10\gensim copying gensim\utils.py -> build\lib.win-amd64-3.10\gensim copying gensim\__init__.py -> build\lib.win-amd64-3.10\gensim creating build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\bleicorpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\csvcorpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\dictionary.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\hashdictionary.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\indexedcorpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\lowcorpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\malletcorpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\mmcorpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\sharded_corpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\svmlightcorpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\textcorpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\ucicorpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\wikicorpus.py -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\corpora\__init__.py -> build\lib.win-amd64-3.10\gensim\corpora creating build\lib.win-amd64-3.10\gensim\models copying gensim\models\atmodel.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\basemodel.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\base_any2vec.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\callbacks.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\coherencemodel.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\doc2vec.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\fasttext.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\hdpmodel.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\keyedvectors.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\ldamodel.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\ldamulticore.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\ldaseqmodel.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\lda_dispatcher.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\lda_worker.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\logentropy_model.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\lsimodel.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\lsi_dispatcher.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\lsi_worker.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\nmf.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\normmodel.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\phrases.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\poincare.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\rpmodel.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\tfidfmodel.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\translation_matrix.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\utils_any2vec.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\word2vec.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\_fasttext_bin.py -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\__init__.py -> build\lib.win-amd64-3.10\gensim\models creating build\lib.win-amd64-3.10\gensim\parsing copying gensim\parsing\porter.py -> build\lib.win-amd64-3.10\gensim\parsing copying gensim\parsing\preprocessing.py -> build\lib.win-amd64-3.10\gensim\parsing copying gensim\parsing\__init__.py -> build\lib.win-amd64-3.10\gensim\parsing creating build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\glove2word2vec.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\make_wiki.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\make_wikicorpus.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\make_wiki_lemma.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\make_wiki_online.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\make_wiki_online_lemma.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\make_wiki_online_nodebug.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\package_info.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\segment_wiki.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\word2vec2tensor.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\word2vec_standalone.py -> build\lib.win-amd64-3.10\gensim\scripts copying gensim\scripts\__init__.py -> build\lib.win-amd64-3.10\gensim\scripts creating build\lib.win-amd64-3.10\gensim\similarities copying gensim\similarities\docsim.py -> build\lib.win-amd64-3.10\gensim\similarities copying gensim\similarities\index.py -> build\lib.win-amd64-3.10\gensim\similarities copying gensim\similarities\levenshtein.py -> build\lib.win-amd64-3.10\gensim\similarities copying gensim\similarities\nmslib.py -> build\lib.win-amd64-3.10\gensim\similarities copying gensim\similarities\termsim.py -> build\lib.win-amd64-3.10\gensim\similarities copying gensim\similarities\__init__.py -> build\lib.win-amd64-3.10\gensim\similarities creating build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\atmodel.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\d2vmodel.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\ftmodel.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\hdp.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\ldamodel.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\ldaseqmodel.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\lsimodel.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\phrases.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\rpmodel.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\text2bow.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\tfidf.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\w2vmodel.py -> build\lib.win-amd64-3.10\gensim\sklearn_api copying gensim\sklearn_api\__init__.py -> build\lib.win-amd64-3.10\gensim\sklearn_api creating build\lib.win-amd64-3.10\gensim\summarization copying gensim\summarization\bm25.py -> build\lib.win-amd64-3.10\gensim\summarization copying gensim\summarization\commons.py -> build\lib.win-amd64-3.10\gensim\summarization copying gensim\summarization\graph.py -> build\lib.win-amd64-3.10\gensim\summarization copying gensim\summarization\keywords.py -> build\lib.win-amd64-3.10\gensim\summarization copying gensim\summarization\mz_entropy.py -> build\lib.win-amd64-3.10\gensim\summarization copying gensim\summarization\pagerank_weighted.py -> build\lib.win-amd64-3.10\gensim\summarization copying gensim\summarization\summarizer.py -> build\lib.win-amd64-3.10\gensim\summarization copying gensim\summarization\syntactic_unit.py -> build\lib.win-amd64-3.10\gensim\summarization copying gensim\summarization\textcleaner.py -> build\lib.win-amd64-3.10\gensim\summarization copying gensim\summarization\__init__.py -> build\lib.win-amd64-3.10\gensim\summarization creating build\lib.win-amd64-3.10\gensim\test copying gensim\test\basetmtests.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\simspeed.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\simspeed2.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\svd_error.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_aggregation.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_api.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_atmodel.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_big.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_BM25.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_coherencemodel.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_corpora.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_corpora_dictionary.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_corpora_hashdictionary.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_d2vmodel.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_datatype.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_direct_confirmation.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_doc2vec.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_dtm.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_fasttext.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_fasttext_wrapper.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_glove2word2vec.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_hdpmodel.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_indirect_confirmation.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_keras_integration.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_keyedvectors.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_keywords.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_ldamallet_wrapper.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_ldamodel.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_ldaseqmodel.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_ldavowpalwabbit_wrapper.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_lda_callback.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_lee.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_logentropy_model.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_lsimodel.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_matutils.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_miislita.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_nmf.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_normmodel.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_parsing.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_phrases.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_poincare.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_probability_estimation.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_rpmodel.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_scripts.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_segmentation.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_sharded_corpus.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_similarities.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_similarity_metrics.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_sklearn_api.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_summarization.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_text_analysis.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_tfidfmodel.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_tmdiff.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_translation_matrix.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_utils.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_utils_any2vec.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_varembed_wrapper.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_word2vec.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\test_wordrank_wrapper.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\utils.py -> build\lib.win-amd64-3.10\gensim\test copying gensim\test\__init__.py -> build\lib.win-amd64-3.10\gensim\test creating build\lib.win-amd64-3.10\gensim\topic_coherence copying gensim\topic_coherence\aggregation.py -> build\lib.win-amd64-3.10\gensim\topic_coherence copying gensim\topic_coherence\direct_confirmation_measure.py -> build\lib.win-amd64-3.10\gensim\topic_coherence copying gensim\topic_coherence\indirect_confirmation_measure.py -> build\lib.win-amd64-3.10\gensim\topic_coherence copying gensim\topic_coherence\probability_estimation.py -> build\lib.win-amd64-3.10\gensim\topic_coherence copying gensim\topic_coherence\segmentation.py -> build\lib.win-amd64-3.10\gensim\topic_coherence copying gensim\topic_coherence\text_analysis.py -> build\lib.win-amd64-3.10\gensim\topic_coherence copying gensim\topic_coherence\__init__.py -> build\lib.win-amd64-3.10\gensim\topic_coherence creating build\lib.win-amd64-3.10\gensim\viz copying gensim\viz\poincare.py -> build\lib.win-amd64-3.10\gensim\viz copying gensim\viz\__init__.py -> build\lib.win-amd64-3.10\gensim\viz creating build\lib.win-amd64-3.10\gensim\models\deprecated copying gensim\models\deprecated\doc2vec.py -> build\lib.win-amd64-3.10\gensim\models\deprecated copying gensim\models\deprecated\fasttext.py -> build\lib.win-amd64-3.10\gensim\models\deprecated copying gensim\models\deprecated\fasttext_wrapper.py -> build\lib.win-amd64-3.10\gensim\models\deprecated copying gensim\models\deprecated\keyedvectors.py -> build\lib.win-amd64-3.10\gensim\models\deprecated copying gensim\models\deprecated\old_saveload.py -> build\lib.win-amd64-3.10\gensim\models\deprecated copying gensim\models\deprecated\word2vec.py -> build\lib.win-amd64-3.10\gensim\models\deprecated copying gensim\models\deprecated\__init__.py -> build\lib.win-amd64-3.10\gensim\models\deprecated creating build\lib.win-amd64-3.10\gensim\models\wrappers copying gensim\models\wrappers\dtmmodel.py -> build\lib.win-amd64-3.10\gensim\models\wrappers copying gensim\models\wrappers\fasttext.py -> build\lib.win-amd64-3.10\gensim\models\wrappers copying gensim\models\wrappers\ldamallet.py -> build\lib.win-amd64-3.10\gensim\models\wrappers copying gensim\models\wrappers\ldavowpalwabbit.py -> build\lib.win-amd64-3.10\gensim\models\wrappers copying gensim\models\wrappers\varembed.py -> build\lib.win-amd64-3.10\gensim\models\wrappers copying gensim\models\wrappers\wordrank.py -> build\lib.win-amd64-3.10\gensim\models\wrappers copying gensim\models\wrappers\__init__.py -> build\lib.win-amd64-3.10\gensim\models\wrappers running egg_info writing gensim.egg-info\PKG-INFO writing dependency_links to gensim.egg-info\dependency_links.txt writing requirements to gensim.egg-info\requires.txt writing top-level names to gensim.egg-info\top_level.txt reading manifest file 'gensim.egg-info\SOURCES.txt' reading manifest template 'MANIFEST.in' warning: no files found matching 'COPYING.LESSER' warning: no files found matching 'ez_setup.py' warning: no files found matching 'gensim\models\doc2vec_inner.c' adding license file 'COPYING' writing manifest file 'gensim.egg-info\SOURCES.txt' copying gensim\_matutils.c -> build\lib.win-amd64-3.10\gensim copying gensim\_matutils.pyx -> build\lib.win-amd64-3.10\gensim copying gensim\corpora\_mmreader.c -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\models\_utils_any2vec.c -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\doc2vec_corpusfile.cpp -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\doc2vec_inner.cpp -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\fasttext_corpusfile.cpp -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\fasttext_inner.c -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\nmf_pgd.c -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\word2vec_corpusfile.cpp -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\word2vec_inner.c -> build\lib.win-amd64-3.10\gensim\models copying gensim\corpora\_mmreader.pyx -> build\lib.win-amd64-3.10\gensim\corpora copying gensim\models\_utils_any2vec.pyx -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\doc2vec_corpusfile.pyx -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\doc2vec_inner.pxd -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\doc2vec_inner.pyx -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\fast_line_sentence.h -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\fasttext_corpusfile.pyx -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\fasttext_inner.pxd -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\fasttext_inner.pyx -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\nmf_pgd.pyx -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\stdint_wrapper.h -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\voidptr.h -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\word2vec_corpusfile.pxd -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\word2vec_corpusfile.pyx -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\word2vec_inner.pxd -> build\lib.win-amd64-3.10\gensim\models copying gensim\models\word2vec_inner.pyx -> build\lib.win-amd64-3.10\gensim\models creating build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\EN.1-10.cbow1_wind5_hs0_neg10_size300_smpl1e-05.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\IT.1-10.cbow1_wind5_hs0_neg10_size300_smpl1e-05.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\OPUS_en_it_europarl_train_one2ten.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\alldata-id-10.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\atmodel_3_0_1_model -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\atmodel_3_0_1_model.expElogbeta.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\atmodel_3_0_1_model.id2word -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\atmodel_3_0_1_model.state -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\bgwiki-latest-pages-articles-shortened.xml.bz2 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\compatible-hash-false.model -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\compatible-hash-true.model -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\cp852_fasttext.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\crime-and-punishment.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\crime-and-punishment.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\crime-and-punishment.vec -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\d2v-lee-v0.13.0 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\doc2vec_old -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\doc2vec_old_sep -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\doc2vec_old_sep.syn0_lockf.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\doc2vec_old_sep.syn1neg.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\dtm_test.dict -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\dtm_test.mm -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\enwiki-latest-pages-articles1.xml-p000000010p000030302-shortened.bz2 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\enwiki-table-markup.xml.bz2 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\euclidean_vectors.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\fasttext_old -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\fasttext_old_sep -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\fasttext_old_sep.syn0_lockf.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\fasttext_old_sep.syn1neg.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\fb-ngrams.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ft_kv_3.6.0.model.gz -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ft_model_2.3.0 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\head500.noblanks.cor -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\head500.noblanks.cor.bz2 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\head500.noblanks.cor_tfidf.model -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\head500.noblanks.cor_wordids.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\high_precision.kv.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\high_precision.kv.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\large_tag_doc_10_iter50 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\lda_3_0_1_model -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\lda_3_0_1_model.expElogbeta.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\lda_3_0_1_model.id2word -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\lda_3_0_1_model.state -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ldamodel_python_2_7 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ldamodel_python_2_7.expElogbeta.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ldamodel_python_2_7.id2word -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ldamodel_python_2_7.state -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ldamodel_python_3_5 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ldamodel_python_3_5.expElogbeta.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ldamodel_python_3_5.id2word -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ldamodel_python_3_5.state -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ldavowpalwabbit.dict.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\ldavowpalwabbit.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\lee.cor -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\lee_background.cor -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\lee_fasttext -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\lee_fasttext.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\lee_fasttext.vec -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\lee_fasttext_new.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\miIslita.cor -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\mihalcea_tarau.kw.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\mihalcea_tarau.kwpos.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\mihalcea_tarau.summ.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\mihalcea_tarau.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\mini_newsgroup -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\nmf_model -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\non_ascii_fasttext.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\old_keyedvectors_320.dat -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\pang_lee_polarity.cor -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\pang_lee_polarity_fasttext.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\pang_lee_polarity_fasttext.vec -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\para2para_text1.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\para2para_text2.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\phraser-3.6.0.model -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\phraser-no-common-terms.pkl -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\phraser-no-scoring.pkl -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\phraser-scoring-str.pkl -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\phrases-3.6.0.model -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\phrases-no-common-terms.pkl -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\phrases-no-scoring.pkl -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\phrases-scoring-str.pkl -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\phrases-transformer-new-v3-5-0.pkl -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\phrases-transformer-v3-5-0.pkl -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\poincare_cp852.tsv -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\poincare_hypernyms.tsv -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\poincare_hypernyms_large.tsv -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\poincare_test_3.4.0 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\poincare_utf8.tsv -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\poincare_vectors.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\pre_0_13_2_model -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\pre_0_13_2_model.state -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\pretrained.vec -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\questions-words.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\reproduce.dat -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\reproduce.dat.gz -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\similarities0-1.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\simlex999.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\small_tag_doc_5_iter50 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\test_corpus_ok.mm -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\test_corpus_small.mm -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\test_glove.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\test_mmcorpus_corrupt.mm -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\test_mmcorpus_no_index.mm -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\test_mmcorpus_no_index.mm.bz2 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\test_mmcorpus_no_index.mm.gz -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\test_mmcorpus_overflow.mm -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\test_mmcorpus_with_index.mm -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\test_mmcorpus_with_index.mm.index -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.blei -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.blei.index -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.blei.vocab -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.low -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.low.index -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.mallet -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.mallet.index -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.mm -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.mm.index -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.svmlight -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.svmlight.index -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.uci -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.uci.index -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.uci.vocab -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus.xml.bz2 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testcorpus_serialization.mm.index -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testlowdistinctwords.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testrepeatedkeywords.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\testsummarization_unrelated.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\tfidf_model.tst -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\tfidf_model.tst.bz2 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\tfidf_model_3_2.tst -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\toy-data.txt -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\toy-model-pretrained.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\toy-model.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\toy-model.vec -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\varembed_lee_subcorpus.cor -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\varembed_morfessor.bin -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\varembed_vectors.pkl -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\w2v-lee-v0.12.0 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\w2v_keyedvectors_load_test.modeldata -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\w2v_keyedvectors_load_test.vocab -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_3.3 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_old -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_old_sep -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_old_sep.syn0_lockf.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_old_sep.syn1neg.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_c -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_py2 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_py3 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_py3_4 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py2 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py2.neg_labels.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py2.syn0.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py2.syn0_lockf.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py2.syn1neg.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py3 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py3.neg_labels.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py3.syn0.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py3.syn0_lockf.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py3.syn1neg.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py3_4 -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py3_4.neg_labels.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py3_4.syn0.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py3_4.syn0_lockf.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\word2vec_pre_kv_sep_py3_4.syn1neg.npy -> build\lib.win-amd64-3.10\gensim\test\test_data copying gensim\test\test_data\wordsim353.tsv -> build\lib.win-amd64-3.10\gensim\test\test_data creating build\lib.win-amd64-3.10\gensim\test\test_data\DTM copying gensim\test\test_data\DTM\ldaseq_3_0_1_model -> build\lib.win-amd64-3.10\gensim\test\test_data\DTM copying gensim\test\test_data\DTM\sstats_test.txt -> build\lib.win-amd64-3.10\gensim\test\test_data\DTM creating build\lib.win-amd64-3.10\gensim\test\test_data\PathLineSentences copying gensim\test\test_data\PathLineSentences\1.txt -> build\lib.win-amd64-3.10\gensim\test\test_data\PathLineSentences copying gensim\test\test_data\PathLineSentences\2.txt.bz2 -> build\lib.win-amd64-3.10\gensim\test\test_data\PathLineSentences creating build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_0.12.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_0.12.1.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_0.12.2.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_0.12.3.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_0.12.4.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_0.13.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_0.13.1.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_0.13.2.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_0.13.3.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_0.13.4.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_1.0.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_1.0.1.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_2.0.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_2.1.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_2.2.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_2.3.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_3.0.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_3.1.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_3.2.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_3.3.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models copying gensim\test\test_data\old_d2v_models\d2v_3.4.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_d2v_models creating build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_0.12.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_0.12.1.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_0.12.2.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_0.12.3.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_0.12.4.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_0.13.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_0.13.1.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_0.13.2.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_0.13.3.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_0.13.4.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_1.0.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_1.0.1.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_2.0.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_2.1.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_2.2.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_2.3.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_3.0.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_3.1.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_3.2.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_3.3.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models copying gensim\test\test_data\old_w2v_models\w2v_3.4.0.mdl -> build\lib.win-amd64-3.10\gensim\test\test_data\old_w2v_models running build_ext building 'gensim.models.word2vec_inner' extension error: Microsoft Visual C++ 14.0 or greater is required. Get it with "Microsoft C++ Build Tools": https://visualstudio.microsoft.com/visual-cpp-build-tools/ [end of output] note: This error originates from a subprocess, and is likely not a problem with pip. error: legacy-install-failure Encountered error while trying to install package. gensim note: This is an issue with the package mentioned above, not pip. hint: See above for output from the failure. [notice] A new release of pip is available: 23.0 -> 23.0.1 [notice] To update, run: python.exe -m pip install --upgrade pip
import gensim
print(gensim.__version__)
4.3.0
# Model creation
model= Word2Vec(words_list, min_count = 1, workers = 4)
model
<gensim.models.word2vec.Word2Vec at 0x23e27586610>
# saving the model
model.save("word2vec.model")
Let’s have a quick look at the vocabulary words:
words = model.wv.key_to_index
len(words)
10766
Observations:
Let’s check out a random word embedding using the trained model:
word = "problem"
model.wv[word]
array([-0.27052438, 0.66047114, 0.43230653, 0.17215721, 0.02286145, -0.7281829 , -0.30398017, 1.2732266 , -0.42268634, -0.49938542, -0.4137816 , -1.0662372 , -0.02833825, 0.36425388, 0.5513033 , -0.4578458 , -0.20516293, -0.5881861 , -0.22395165, -1.2217709 , 0.6628571 , 0.5715375 , 0.19484676, 0.11814828, -0.6140165 , 0.16245942, -0.49345425, -0.41365436, -0.39757445, -0.10512283, 0.3785423 , 0.14333546, -0.03230482, -0.01653199, 0.14334445, 0.7196108 , 0.18091239, -0.34358716, -0.48699692, -1.3056369 , -0.2313133 , -0.07931656, -0.11053191, 0.11854964, 0.41814506, -0.00526063, -0.68860596, -0.12569857, 0.47122386, 0.3085363 , -0.12029681, -0.3997099 , -0.08677533, 0.00372786, -0.5070272 , 0.58567125, 0.2485312 , 0.02351037, -0.732655 , 0.11326014, 0.5195224 , 0.20478901, 0.02056827, 0.19526173, -1.0651096 , 0.2629253 , 0.4433038 , 0.55956984, -0.5095072 , 0.6356722 , 0.02562599, 0.31776133, 0.603559 , -0.47490615, 0.55050033, 0.81685334, 0.02502957, 0.24660029, -0.15513995, -0.12330483, -0.2676922 , -0.04856463, -0.54694176, 0.7951504 , -0.10051891, 0.31335434, 0.01176996, 0.48085523, 0.47729316, -0.24665968, 0.9952517 , 0.14649296, 0.11186937, 0.35738602, 1.0985878 , 0.72549397, -0.22098272, -1.1246119 , 0.1612393 , 0.0695435 ], dtype=float32)
# Let's see the vocab size
print("Length of the vocab is = ", len(list(model.wv.key_to_index)))
Length of the vocab is = 10766
# Top 10 similar words to the word 'happy'
similar = model.wv.similar_by_word('happy')
print(similar)
[('functionality', 0.9992568492889404), ('enough', 0.9991658926010132), ('bad', 0.9991382360458374), ('almost', 0.9987328052520752), ('impressed', 0.9985877275466919), ('download', 0.9985634684562683), ('ok', 0.9984879493713379), ('certainly', 0.9984339475631714), ('disappointed', 0.9982004165649414), ('recommended', 0.9979566335678101)]
# Top 10 similar words to the word 'premium'
similar = model.wv.similar_by_word('premium')
print(similar)
[('paid', 0.9978939294815063), ('free', 0.9884848594665527), ('pro', 0.9878509640693665), ('buy', 0.9847126603126526), ('upgraded', 0.9798856973648071), ('bought', 0.9749058485031128), ('pay', 0.9737546443939209), ('payed', 0.9733726978302002), ('full', 0.9695376753807068), ('version', 0.9678621292114258)]
words = model.wv.key_to_index
wvs = model.wv[words]
from gensim.models import Word2Vec
def average_word_vectors(words, model, vocabulary, num_features):
feature_vector = np.zeros((num_features,), dtype="float64")
nwords = 0.
for word in words:
if word in vocabulary:
nwords = nwords + 1.
feature_vector = np.add(feature_vector, model.wv[word])
if nwords:
feature_vector = np.divide(feature_vector, nwords)
return feature_vector
def averaged_word_vectorizer(corpus, model, num_features):
vocabulary = set(model.wv.key_to_index)
features = [average_word_vectors(tokenized_sentence, model, vocabulary, num_features)
for tokenized_sentence in corpus]
return np.array(features)
feature_size = 100
# get document level embeddings
w2v_feature_array = averaged_word_vectorizer(corpus = words_list, model = model,
num_features = feature_size)
pd.DataFrame(w2v_feature_array)
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | … | 90 | 91 | 92 | 93 | 94 | 95 | 96 | 97 | 98 | 99 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | -0.229789 | 0.593847 | 0.412772 | 0.223934 | -0.019704 | -0.751078 | -0.234899 | 1.150044 | -0.460823 | -0.524459 | … | 0.861504 | 0.160020 | 0.100419 | 0.342614 | 1.032167 | 0.663403 | -0.274530 | -1.021782 | 0.147821 | 0.061832 |
1 | -0.097555 | 0.277454 | 0.193175 | 0.108737 | -0.040563 | -0.394737 | -0.075167 | 0.544438 | -0.253749 | -0.291141 | … | 0.429767 | 0.068847 | 0.022167 | 0.179019 | 0.523606 | 0.312639 | -0.165798 | -0.482241 | 0.062892 | 0.058780 |
2 | -0.170952 | 0.497397 | 0.346304 | 0.139143 | -0.042142 | -0.541193 | 0.005205 | 0.894036 | -0.319767 | -0.611304 | … | 0.863967 | -0.012429 | -0.016678 | 0.379411 | 0.874443 | 0.466733 | -0.289189 | -0.654143 | 0.032389 | 0.240297 |
3 | -0.143223 | 0.306412 | 0.221153 | 0.135441 | -0.028698 | -0.497967 | -0.089266 | 0.724956 | -0.351435 | -0.343889 | … | 0.511447 | 0.118109 | 0.004788 | 0.163300 | 0.672340 | 0.386130 | -0.147217 | -0.569982 | 0.087615 | -0.001157 |
4 | -0.038865 | 0.089341 | 0.073515 | 0.030779 | 0.003670 | -0.106232 | -0.042441 | 0.179975 | -0.071766 | -0.078861 | … | 0.141753 | 0.026758 | 0.008873 | 0.048093 | 0.160513 | 0.097789 | -0.027041 | -0.158747 | 0.025139 | -0.004630 |
… | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … | … |
12490 | -0.130908 | 0.274303 | 0.216486 | 0.135022 | -0.037544 | -0.502736 | -0.116020 | 0.715697 | -0.376579 | -0.339709 | … | 0.458096 | 0.167547 | -0.015535 | 0.110368 | 0.655325 | 0.370587 | -0.122764 | -0.572411 | 0.108028 | -0.061343 |
12491 | 0.008760 | -0.002272 | -0.006867 | 0.007277 | 0.000761 | 0.008345 | -0.002872 | -0.008380 | -0.004519 | 0.004462 | … | 0.008209 | 0.000245 | -0.000154 | 0.008418 | 0.009567 | -0.004852 | -0.000436 | -0.004150 | 0.001532 | -0.001190 |
12492 | -0.223108 | 0.534245 | 0.382745 | 0.179026 | -0.004710 | -0.765861 | -0.166396 | 1.160753 | -0.476742 | -0.601194 | … | 0.890069 | 0.144968 | -0.002535 | 0.325299 | 1.063407 | 0.636505 | -0.267623 | -0.900975 | 0.096009 | 0.035216 |
12493 | -0.167480 | 0.460005 | 0.363941 | 0.164009 | -0.042362 | -0.595785 | -0.132989 | 0.991289 | -0.394554 | -0.539594 | … | 0.813269 | 0.119084 | -0.027819 | 0.275314 | 0.924026 | 0.522199 | -0.231620 | -0.788232 | 0.111122 | 0.038227 |
12494 | -0.097190 | 0.198784 | 0.152250 | 0.136117 | -0.034669 | -0.499222 | 0.067609 | 0.684397 | -0.318572 | -0.458987 | … | 0.471958 | 0.049555 | -0.010051 | 0.109419 | 0.674455 | 0.329052 | -0.135996 | -0.366565 | -0.007977 | 0.077911 |
12495 rows × 100 columns
ln = LabelEncoder()
y = ln.fit_transform(data['label'])
y
array([2, 2, 2, ..., 0, 0, 0])
from sklearn.model_selection import train_test_split
xtrain,xtest,ytrain,ytest = train_test_split(w2v_feature_array,y, test_size=0.2, random_state=42)
# intializing the Random Forest model
model = RandomForestClassifier(random_state = 7)
# fitting the model on training set
model.fit(xtrain,ytrain)
RandomForestClassifier(random_state=7)
# making predictions on the test set
y_pred = model.predict(xtest)
# checking the metrics
print(classification_report(y_pred, ytest))
precision recall f1-score support 0 0.78 0.69 0.74 1269 1 0.02 0.19 0.04 53 2 0.75 0.61 0.68 1177 accuracy 0.64 2499 macro avg 0.52 0.50 0.48 2499 weighted avg 0.75 0.64 0.69 2499
Global Vectors for Word Representation, also known as GloVe, is an alternative to Word2vec. Just like Word2vec, it is also an unsupervised learning algorithm that allows us to take a corpus of text, and intuitively map each word in that document into a position in a high-dimensional latent space which gives us the vector representation of the word.
# load the GloVe vectors in a dictionary:
embeddings_index = {}
f = open(r'glove.6B.300d.txt', encoding="utf8")
for line in tqdm(f):
# Splitting the each line
values = line.split()
word = values[0]
coefs = np.array(values[1:], dtype='float32')
embeddings_index[word] = coefs
f.close()
print('Found %s word vectors.' % len(embeddings_index))
400000it [00:34, 11594.24it/s]
Found 400000 word vectors.
# This function creates a normalized vector for the whole sentence
def sent2vec(s):
words = word_tokenize(s)
N = []
for w in words:
try:
N.append(embeddings_index[w])
except:
continue
N = np.array(N)
v = N.sum(axis=0)
if type(v) != np.ndarray:
return np.zeros(300)
return v / np.sqrt((v ** 2).sum())
# Let's have a quick look on the independent feature
data.final_text.values
array(['cannot open app anymore', 'begging refund app month nobody replying', 'costly premium version approx indian rupees per year better download premium version app apkmos website use microsoft list app far better', ..., 'useful apps must try', 'would pay even additions updates absolutely love wish would develop even', 'sooow good'], dtype=object)
y = ln.fit_transform(data['label'])
y
array([2, 2, 2, ..., 0, 0, 0])
xtrain, xvalid, ytrain, yvalid = train_test_split(data.final_text.values, y,
stratify = y,
random_state = 42,
test_size = 0.2, shuffle = True)
ytrain
array([0, 0, 0, ..., 2, 2, 0])
print("Shape of Training set : ", xtrain.shape)
print("Shape of test set : ", xvalid.shape)
Shape of Training set : (9996,) Shape of test set : (2499,)
# Create sentence vectors using the above function for training and validation set
xtrain_glove = [sent2vec(x) for x in tqdm(xtrain)]
xvalid_glove = [sent2vec(x) for x in tqdm(xvalid)]
100%|████████████████████████████████████████████████████████████████████████████| 9996/9996 [00:01<00:00, 6150.50it/s] 100%|████████████████████████████████████████████████████████████████████████████| 2499/2499 [00:00<00:00, 5308.50it/s]
# intializing the Random Forest model
model = RandomForestClassifier(random_state = 7)
# fitting the model on training set
model.fit(xtrain_glove,ytrain)
RandomForestClassifier(random_state=7)
# making predictions on the test set
y_pred = model.predict(xvalid_glove)
# checking the metrics
print(classification_report(y_pred,yvalid))
precision recall f1-score support 0 0.81 0.73 0.76 1257 1 0.01 0.44 0.02 9 2 0.80 0.63 0.70 1233 accuracy 0.68 2499 macro avg 0.54 0.60 0.50 2499 weighted avg 0.80 0.68 0.73 2499
Conclusions
Both the models differ in the way they are trained, and hence they output different word vectors.
The GloVe model is based on global word-to-word co-occurrence counts taking the whole corpus into consideration, whereas Word2vec uses co-occurrence of local context (neighboring words).
GloVe learns its embeddings by constructing the overall co-occurrence matrix, on the other hand, the Word2vec model learns by making predictions by taking context words as inputs and predicting the target words.
Represent words as semantically-meaningful dense real-valued vectors.
This overcomes many of the problems that simple one-hot vector encodings have.
Most importantly, embeddings boost generalisation and performance for pretty much any NLP problem, especially if you don’t have a lot of training data.
Business Recommendations
Dense embedding models such as Word2vec and GloVe can be very useful in automatic sentiment analysis of the reviews of an app on the Google Play Store, and tracking its progression through time.
Outside of sentiment analysis and classification oriented tasks, Unsupervised Learning can also be performed to group the reviews into various clusters, which would likely represent certain themes due to the semantic nature of the word embeddings generated by Word2vec and GloVe. This could be another use case of using dense embeddings to extract insights from the text.