GithubHelp home page GithubHelp logo

clustering's Introduction

Practico Clustering


Catedra: Text mining, FAMAF 2021

Se realiza un clustering de palabras.

Se obtienen varios conjuntos de cluster, diferencados en el tamaño del entorno que se toma de cada palabra.
Trabajo restante:

  • Comparar los conjuntos de cluster y tomar decisión por un entorno fijo a cada palabra.
  • Añadir embeding :D

Para ver los resultados no hace falta ejecutar todo el notebook. Solo se debe:

  • Ejecutar la celda con imports.
  • Ejecutar las celdas que contienen las funciones load_files y load_clusters
  • Ejecutar las celdas
c20_0_0 = load_clusters(0, 0, '20')
c20_1_1 = load_clusters(1, 1, '20')
c20_2_2 = load_clusters(2, 2, '20')
c20_5_5 = load_clusters(5, 5, '20')
c30_5_5 = load_clusters(5, 5, '30')

y

_, _, key_words_0_0, _, matrix_dicc2d_0_0, new_words_0_0 = load_files(0, 0)
_, _, key_words_1_1, _, matrix_dicc2d_1_1, new_words_1_1 = load_files(1, 1)
_, _, key_words_2_2, _, matrix_dicc2d_2_2, new_words_2_2 = load_files(2, 2)
_, _, key_words_5_5, _, matrix_dicc2d_5_5, new_words_5_5 = load_files(5, 5)

Luego solo resta grafigar y comparar listar de clusters descomentando en las ultimas celdas.

Si desea explorar distintos números de clusters y/o entornos sentase libre de hacerlo.

clustering's People

Contributors

morenocl avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.