Trending Topic Extraction using Topic Models and Biterm Discrimination
dc.creator | Quesada Grosso, Minor Eduardo | |
dc.creator | Casasola Murillo, Edgar | |
dc.creator | Leoni de León, Jorge Antonio | |
dc.date.accessioned | 2019-11-18T16:45:15Z | |
dc.date.available | 2019-11-18T16:45:15Z | |
dc.date.issued | 2017 | |
dc.description.abstract | Mining and exploitation of data in social networks has been the focus of many efforts, but despite the resources and energy invested, still remains a lot for doing given its complexity, which requires the adoption of a multidisciplinary approach . Specifically, on what concerns to this research, the content of the texts published regularly, and at a very rapid pace, at sites of microblogs (eg Twitter.com) can be used to analyze global and local trends. These trends are marked by microblogs emerging topics that are distinguished from others by a sudden and accelerated rate of posts related to the same topic; in other words, by an increment of popularity in relatively short periods, a day or a few hours, for example Wanner et al. . The problem, then, is twofold, first to extract the topics, then to identify which of those topics are trending. A recent solution, known as Bursty Biterm Topic Model (BBTM) is an algorithm for identifying trending topics, with a good level of performance in Twitter, but it requires great amount of computer processing. Hence, this research aims to determine if it is possible to reduce the amount of processing required and getting equally good results. This reduction carry out by a discrimination of co-occurrences of words (biterms) used by BBTM to model trending topics. In contrast to our previous work, in this research, we carry on a more complete and exhaustive set of experiments. | es_ES |
dc.description.procedence | UCR::Vicerrectoría de Docencia::Ingeniería::Facultad de Ingeniería::Escuela de Ciencias de la Computación e Informática | es_ES |
dc.description.procedence | UCR::Vicerrectoría de Docencia::Artes y Letras::Facultad de Letras::Escuela de Filología, Lingüística y Literatura | es_ES |
dc.description.sponsorship | Universidad de Costa Rica/[745-B4-048]UCR/Costa Rica | es_ES |
dc.description.sponsorship | Universidad de Costa Rica/[745-B6-175]UCR/Costa Rica | es_ES |
dc.identifier.citation | http://www2.clei.org/cleiej/paper.php?id=376 | |
dc.identifier.codproyecto | 745-B4-048 | |
dc.identifier.codproyecto | 745-B6-175 | |
dc.identifier.doi | 10.19153/cleiej.20.1.3 | |
dc.identifier.issn | 0717- 5000 | |
dc.identifier.uri | https://hdl.handle.net/10669/79877 | |
dc.language.iso | en_US | es_ES |
dc.rights | acceso abierto | es_ES |
dc.source | Clei Electronic Journal, vol. 20(1), pp.1-16 | es_ES |
dc.subject | Trending topics | es_ES |
dc.subject | Topic models | es_ES |
dc.subject | Short text | es_ES |
dc.subject | NLP | es_ES |
dc.subject | Topic extraction | es_ES |
dc.subject | Natural language processing | es_ES |
dc.title | Trending Topic Extraction using Topic Models and Biterm Discrimination | es_ES |
dc.type | artículo original |
Archivos
Bloque original
1 - 1 de 1
Cargando...
- Nombre:
- Trending Topic Extraction.pdf
- Tamaño:
- 1.04 MB
- Formato:
- Adobe Portable Document Format
- Descripción:
- Artículo principal
Bloque de licencias
1 - 1 de 1
No hay miniatura disponible
- Nombre:
- license.txt
- Tamaño:
- 2.83 KB
- Formato:
- Item-specific license agreed upon to submission
- Descripción: