“TF-IDF Реализация Python” Ответ

TF-IDF Реализация Python

#Importing required module
import numpy as np
from nltk.tokenize import  word_tokenize 
 
#Example text corpus for our tutorial
text = ['Topic sentences are similar to mini thesis statements.\
        Like a thesis statement, a topic sentence has a specific \
        main point. Whereas the thesis is the main point of the essay',\
        'the topic sentence is the main point of the paragraph.\
        Like the thesis statement, a topic sentence has a unifying function. \
        But a thesis statement or topic sentence alone doesn’t guarantee unity.', \
        'An essay is unified if all the paragraphs relate to the thesis,\
        whereas a paragraph is unified if all the sentences relate to the topic sentence.']
 
#Preprocessing the text data
sentences = []
word_set = []
 
for sent in text:
    x = [i.lower() for  i in word_tokenize(sent) if i.isalpha()]
    sentences.append(x)
    for word in x:
        if word not in word_set:
            word_set.append(word)
 
#Set of vocab 
word_set = set(word_set)
#Total documents in our corpus
total_documents = len(sentences)
 
#Creating an index for each word in our vocab.
index_dict = {} #Dictionary to store index for each word
i = 0
for word in word_set:
    index_dict[word] = i
    i += 1

Rias Dwi Prasasti

Ответы похожие на “TF-IDF Реализация Python”

Привязки Python 2 для RPM необходимы для этого модуля. Если вам нужна поддержка Python 3, используйте вместо этого модуль `dnf` ansible .. Модуль Python 2 Yum необходим для этого модуля. Если вам нужна поддержка Python 3, используйте вместо этого модуль `dnf`.

Вопросы похожие на “TF-IDF Реализация Python”

Как загрузить несколько словарных значений, которые хранятся в файле Python и загрузить в другой файл Python в Python Over Loop

Больше похожих ответов на “TF-IDF Реализация Python” по Python

Смотреть популярные ответы по языку

Смотреть другие языки программирования

Shell/Bash

C++

CSS

HTML

Java

JavaScript

Objective-C

PHP

Python

Sql

Swift

Ruby

TypeScript

Kotlin

Assembly

VBA

Scala

Rust

Dart

Elixir

Clojure

Haskell

Matlab

Erlang

Cobol

Fortran

Scheme

Perl

Groovy

Lua

Julia

Delphi

Abap

Lisp

Prolog

Pascal

ActionScript

Basic

Solidity

PowerShell

GDScript

Excel