Modules:
interfaces – Core gensim interfacesutils – Various utility functionsmatutils – Math utils_matutils – Cython matutilsdownloader – Downloader API for gensimcorpora.bleicorpus – Corpus in Blei’s LDA-C formatcorpora.csvcorpus – Corpus in CSV formatcorpora.dictionary – Construct word<->id mappingscorpora.hashdictionary – Construct word<->id mappingscorpora.indexedcorpus – Random access to corpus documentscorpora.lowcorpus – Corpus in GibbsLda++ formatcorpora.malletcorpus – Corpus in Mallet formatcorpora.mmcorpus – Corpus in Matrix Market formatcorpora._mmreader – Reader for corpus in the Matrix Market format.corpora.sharded_corpus – Corpus stored in separate filescorpora.svmlightcorpus – Corpus in SVMlight formatcorpora.textcorpus – Tools for building corpora with dictionariescorpora.ucicorpus – Corpus in UCI formatcorpora.wikicorpus – Corpus from a Wikipedia dumpmodels.ldamodel – Latent Dirichlet Allocation
models.ldamulticore – parallelized Latent Dirichlet Allocation
models.nmf – Non-Negative Matrix factorizationmodels.lsimodel – Latent Semantic Indexingmodels.ldaseqmodel – Dynamic Topic Modeling in Pythonmodels.tfidfmodel – TF-IDF modelmodels.rpmodel – Random Projectionsmodels.hdpmodel – Hierarchical Dirichlet Processmodels.logentropy_model – LogEntropy modelmodels.normmodel – Normalization modelmodels.translation_matrix – Translation Matrix model
models.lsi_dispatcher – Dispatcher for distributed LSI
models.lsi_worker – Worker for distributed LSI
models.lda_dispatcher – Dispatcher for distributed LDA
models.lda_worker – Worker for distributed LDA
models.atmodel – Author-topic modelsmodels.word2vec – Word2vec embeddings
models.keyedvectors – Store and query word vectors
models.doc2vec – Doc2vec paragraph embeddings
models.fasttext – FastText model
models._fasttext_bin – Facebook I/Omodels.phrases – Phrase (collocation) detectionmodels.poincare – Train and use Poincare embeddingsmodels.coherencemodel – Topic coherence pipelinemodels.basemodel – Core TM interfacemodels.callbacks – Callbacks for track and viz LDA train process
models.utils_any2vec – Utils for any2vec modelsmodels._utils_any2vec – Cython utils for any2vec modelsmodels.word2vec_inner – Cython routines for training Word2Vec modelsmodels.doc2vec_inner – Cython routines for training Doc2Vec modelsmodels.fasttext_inner – Cython routines for training FastText modelsmodels.wrappers.ldamallet – Latent Dirichlet Allocation via Mallet
models.wrappers.dtmmodel – Dynamic Topic Models (DTM) and Dynamic Influence Models (DIM)
models.wrappers.ldavowpalwabbit – Latent Dirichlet Allocation via Vowpal Wabbit
models.wrappers.wordrank – Word Embeddings from WordRank
models.wrappers.varembed – VarEmbed Word Embeddingsmodels.wrappers.fasttext – Wrapper for FastText implementation from Facebookmodels.deprecated.doc2vec – Deep learning with paragraph2vecmodels.deprecated.fasttext – FastText modelmodels.deprecated.word2vec – Deep learning with word2vecmodels.deprecated.keyedvectors – Store and query word vectorsmodels.deprecated.fasttext_wrapper – Wrapper for Facebook implementation of FastText modelmodels.base_any2vec – Base classes for any2vec modelssimilarities.docsim – Document similarity queries
similarities.termsim – Term similarity queriessimilarities.index – Fast Approximate Nearest Neighbor Similarity with Annoy package
sklearn_api.atmodel – Scikit learn wrapper for Author-topic modelsklearn_api.d2vmodel – Scikit learn wrapper for paragraph2vec modelsklearn_api.hdp – Scikit learn wrapper for Hierarchical Dirichlet Process modelsklearn_api.ldamodel – Scikit learn wrapper for Latent Dirichlet Allocationsklearn_api.ldaseqmodel – Scikit learn wrapper for LdaSeq modelsklearn_api.lsimodel – Scikit learn wrapper for Latent Semantic Indexingsklearn_api.phrases – Scikit learn wrapper for phrase (collocation) detectionsklearn_api.rpmodel – Scikit learn wrapper for Random Projection modelsklearn_api.text2bow – Scikit learn wrapper word<->id mappingsklearn_api.tfidf – Scikit learn wrapper for TF-IDF modelsklearn_api.w2vmodel – Scikit learn wrapper for word2vec modeltest.utils – Common utils
topic_coherence.aggregation – Aggregation moduletopic_coherence.direct_confirmation_measure – Direct confirmation measure moduletopic_coherence.indirect_confirmation_measure – Indirect confirmation measure moduletopic_coherence.probability_estimation – Probability estimation moduletopic_coherence.segmentation – Segmentation moduletopic_coherence.text_analysis – Analyzing the texts of a corpus to accumulate statistical information about word occurrencesscripts.package_info – Information about gensim packagescripts.glove2word2vec – Convert glove format to word2vec
scripts.make_wikicorpus – Convert articles from a Wikipedia dump to vectors.scripts.word2vec_standalone – Train word2vec on text file CORPUSscripts.make_wiki_online – Convert articles from a Wikipedia dumpscripts.make_wiki_online_lemma – Convert articles from a Wikipedia dumpscripts.make_wiki_online_nodebug – Convert articles from a Wikipedia dumpscripts.word2vec2tensor – Convert the word2vec format to Tensorflow 2D tensor
scripts.segment_wiki – Convert wikipedia dump to json-line format
parsing.porter – Porter Stemming Algorithm
parsing.preprocessing – Functions to preprocess raw text
summarization.bm25 – BM25 ranking function
summarization.commons – Common graph functionssummarization.graph – Graphsummarization.keywords – Keywords for TextRank summarization algorithm
summarization.mz_entropy – Keywords for the Montemurro and Zanette entropy algorithmsummarization.pagerank_weighted – Weighted PageRank algorithmsummarization.summarizer – TextRank Summariser
summarization.syntactic_unit – Syntactic Unit classsummarization.textcleaner – Summarization pre-processing
viz.poincare – Visualize Poincare embeddings