Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
Bag of biterms modeling for short texts
by
Phan, Tuan Anh
, Nguyen, Thien Huu
, Bach, Tran
, Ngo, Van Linh
, Khoat, Than
in
Algorithms
/ Context
/ Digital media
/ Dirichlet problem
/ Machine learning
/ Probabilistic models
/ Representations
/ Statistical analysis
/ Statistical models
/ Texts
2020
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
Bag of biterms modeling for short texts
by
Phan, Tuan Anh
, Nguyen, Thien Huu
, Bach, Tran
, Ngo, Van Linh
, Khoat, Than
in
Algorithms
/ Context
/ Digital media
/ Dirichlet problem
/ Machine learning
/ Probabilistic models
/ Representations
/ Statistical analysis
/ Statistical models
/ Texts
2020
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
Journal Article
Bag of biterms modeling for short texts
2020
Request Book From Autostore
and Choose the Collection Method
Overview
Analyzing texts from social media encounters many challenges due to their unique characteristics of shortness, massiveness, and dynamic. Short texts do not provide enough context information, causing the failure of the traditional statistical models. Furthermore, many applications often face with massive and dynamic short texts, causing various computational challenges to the current batch learning algorithms. This paper presents a novel framework, namely bag of biterms modeling (BBM), for modeling massive, dynamic, and short text collections. BBM comprises of two main ingredients: (1) the concept of bag of biterms (BoB) for representing documents, and (2) a simple way to help statistical models to include BoB. Our framework can be easily deployed for a large class of probabilistic models, and we demonstrate its usefulness with two well-known models: latent Dirichlet allocation (LDA) and hierarchical Dirichlet process (HDP). By exploiting both terms (words) and biterms (pairs of words), the major advantages of BBM are: (1) it enhances the length of the documents and makes the context more coherent by emphasizing the word connotation and co-occurrence via bag of biterms, and (2) it inherits inference and learning algorithms from the primitive to make it straightforward to design online and streaming algorithms for short texts. Extensive experiments suggest that BBM outperforms several state-of-the-art models. We also point out that the BoB representation performs better than the traditional representations (e.g., bag of words, tf-idf) even for normal texts.
Publisher
Springer Nature B.V
This website uses cookies to ensure you get the best experience on our website.