Conceptual Representation for Crisis-Related Tweet Classification

Choi, Won-Gyu; Lee, Kyung-Soon; Choi, Won-Gyu; Lee, Kyung-Soon

doi:10.13053/cys-23-4-3304

Serviços Personalizados

Journal

Artigo

Indicadores

Citado por SciELO
Acessos

Links relacionados

Similares em SciELO

Mais
Mais

Permalink

Computación y Sistemas

versão On-line ISSN 2007-9737versão impressa ISSN 1405-5546

Comp. y Sist. vol.23 no.4 Ciudad de México Out./Dez. 2019 Epub 09-Ago-2021

https://doi.org/10.13053/cys-23-4-3304

Articles

Conceptual Representation for Crisis-Related Tweet Classification

Won-Gyu Choi¹

Kyung-Soon Lee¹^*

^¹ Jeonbuk National University, Division of Computer Science and Engineering, RCAIT, Republic of Korea. smtj119@jbnu.ac.kr, selfsolee@jbnu.ac.kr

Abstract

The importance of social media such as Twitter, as a conduit for actionable and tactical information during disasters is increasingly recognized. During crisis situations, rapid and effective response actions by emergency services are critical to assure the safety of the public. In this paper, we propose a conceptual representation for crisis-related tweet classification. In order to classify a stream of tweets related to the incident, the crisis-related terms in each tweet are represented as conceptual entities such as event entities, category indicator entities, information type entities, URL entities, and user entities. For tweet classification, we have compared support vector machines and deep learning model which combines class activation mapping with one-shot learning in convolutional neural networks. Experimental results on TREC 2018 Incident Streams test collection show significant improvement over the baseline system.

Keywords: Conceptual representation; crisis-related tweets classification; incident streams; convolutional neural networks; class activation mapping; one-shot learning; support vector machines

1 Introduction

In recent years, media has become an important source of real-time information. People use social media to communicate, socialize, debate and engage in arguments. With the rapidly increasing amount of data on social networking service (SNS) like Twitter, researches on event detection and classification for the crisis-related tweet data are attracting more and more attention. During a crisis situation, people use SNS to post situational updates, look for useful information, and ask for help ^[¹^,²^]. In these cases, users can help the victims by classifying and understanding the posts as crisis-related tweets.

During natural disasters, SNS such as Twitter and Facebook are widely used by affected people to post crisis-related messages. These crisis-related posts disperse among multiple categories like information wanted about missing, injured and dead people, and infrastructure damage. The challenge here is to classify important situational updates from these crisis-related tweets, assign them appropriate informational categories ^[³^,⁴^].

Related with this challenge, TREC 2018 introduced an Incident Streams (TREC-IS) track which is designed to categorize a stream of tweets related to each incident by information type in order to automatically process social media streams during emergency situations ^[⁵^]. There have been several approaches to classify incident streams by using deep learning models and word representation methods. Chy et al. ^[⁶^] used a method of combining convolutional neural networks (CNNs) and Long Short Term Memory (LSTM). Buntain ^[⁷^] used a pre-trained word vector from Wikipedia and the FastText method. Miyazaki et al. ^[⁸^] used Convolutional Neural Networks (CNNs) model and a simple Multilayer Perceptron (MLP) by inputting text BoW vector.

Deep learning has made major advances in areas such as image processing and natural language processing.

Convolutional neural networks have recently achieved remarkably strong performance on the practically important task of sentence classification ^[⁹^]. In recent image classification research, a technique for generating a class activation map (CAM) is introduced using the global average pooling in CNNs ^[¹⁰^]. A class activation map for a specific class indicates the discriminative image regions which are relevant to that class.

On the other hand, when learning with small number of training data, deep learning does not provide a satisfactory performance. One-shot learning aims to learn information about object categories from a few training images.

In this paper, in order to categorize a stream of tweets related to each incident by information types, we propose a conceptual representation for crisis-related tweet classification. The crisis-related terms in a tweet are represented as conceptual entities such as event entities, category indicator entities, information entities, URL entities, and user entities to represent the specific characteristics of information types in disaster and emergency situation.

We have experimented using deep learning models which combines CAM and one-shot learning model. The CAM is applied to text classification by highly weighting significant term based on class activation mapping using global average pooling instead of fully connected layer. To deal with small number of training data to learn, we have used one-shot learning method using fully connected layer. To show the effectiveness of our method, experiments are conducted on a TREC-IS 2018 test collection.

The paper is organized as follows: Section 2 describes a conceptual representation for crisis-related tweets; Section 3 presents combining deep learning models. Section 4 describes experimental results. Finally, we conclude in Section 5.

2 Conceptual Representation for Crisis-Related Tweets

In order to classify a stream of tweets related to the incident, the terms in each tweet are represented as conceptual entities such as event entities, category indicator entities, information type entities, URL entities, and user entities. The conceptual entities are defined according to the information type categories.

2.1 Event Entities

In TREC-IS training data, hashtags and keywords used for collecting a stream of tweets related to each incident have been provided. For conceptual representation, the hashtags and keywords are represented as event entities: <EventLoc>, <EventCri>, and <EventClu> according to the information in such as a location, an incident and a clue. The followings are examples of event entity representation for each hashtag or keyword.

— <EventLoc>: #colorado wildfire, #abflood, Lax airport, #colorado, #costarica, ...
— <EventCri>: #colorado wildfire, #abflood, #earthquake, #Wildfire, ...
— <EventClu>: #StrongerPH, #prayfornsw, #SafeNow, ...

Since each tweet includes event entities, an event entity itself may not give information. However, it can reduce the noise by the use of diverse terms.

2.2 Category Indicator Entities

In a training data, category indicator terms are provided for each tweet by human assessors. We have used the indicator terms for each category indicator entity which are provided by TREC-IS training data.

Table 1 Examples of category indicator entities

Category Indicator Entity	Examples
<MultimediaShare>	#photo, camera image, videos
<InfomationWanted>	does anyone know, report?
<ServiceAvailable>	heavy air tanker, animal hospital, offering housing
<Official>	tsunami warning issued, briefing, emergency declaration
<Donations>	donations, items needed
<Weather>	#weather, degree, rain
<Movepeople>	evacuate, leave town, evacuation order
<EmergingThreats>	road blocked, sewage treatment disabled, landslide reported

Table 2 Examples of information type entities with expected category

Information Type Entity	Examples	Expected Category
<Disaster-Threat>	Power is out, People trapped, No power	EmergingThreats
<Evacuation>	leave NOW, evacuation orders, closed the highway	MovePeople
<Government>	USGS, LarimerSheriff, Commander	Official
<Organization>	HumaneSociety, RedCrossDenever, NoCORedCross	Official
<Emergency>	be on ALERT, please don't wait	EmergingThreats
<Unit>	magnitude, dollars	Factoid
<Provider>	Red Cross, Animal Hosptal, #Gov20	Volunteer
<Communication>	Call, text, hotline, dm	Official, Advice
<Service>	Animal boarding pick up, disaster recovery center opened	ServiceAvailable
<Change>	Magnitude reduced, update, now cancelled	EmergingThreats

Table 3 Examples of URL entities with expected category

URL Entity	Examples	Expected Category
<URLVideo>	youtube.com, vimeo.com	MultimediaShare
<URLPhoto>	instagram.com, facebook.com/photo.php	MultimediaShare
<URLNews>	yahoo.com/news, cbsnews.com, bbc.co.uk	ContinuingNews, PastNews
<URLWeather>	wunderground.com, theweathernetwork.com	Weather
<URLOrg>	larimer.org, usgs.gov, emergenza24.org	Official
<URLDonation>	worldvision.org.ph, prizeo.com	Donations
<URLDisaster>	sismos.cl tenki.jp	EmergingThreats
<URLBlogMgz>	livejournal.com grist.org	KnownAlready

Table 4 Examples of user entity for expected category

User Entity	Examples	Expected Category
<UserNews>	@9NEWS, @ANCALERTS @BreakingNews,	ContinuingNews, PastNews
<UserWeather>	@dost_pagasa, @breakingstorm	Weather
<UserOrg>	@NSWRFS, @oxfamgb, @NASA	Official
<UserDonation>	@RescuePH	Donations
<UserDisasterInfo>	@NewEarthquake, @INGVterremoti	EmergingThreats
<UserMultimedia>	@youtube	MultimediaShare

The number of categories in TREC-IS and category indicator entities are 25 such as <InformationWanted>, <ServiceAvailable>, <Official>, <Donations>, <EmergingThreats> and etc.

The terms for 5 category indicator entities such as volunteering, donation, multimedia, and sentiment are expanded using terms in thesaurus.com and relatedwords.org.

2.3 Information Type Entities

In order to represent information type for each term, information type entities are defined as followings: <Question>, <Communication>, <Organization>, <Evacuation>, <Location>, <Emergency>, <Person1st>, <Informal>, <Emotion/reaction>, <Narration>, <News>, <Disaster/threat>, <Change>, <Service>, <Provider>, <Fact>, <Unit>, <Government>, <Advice>, <Pastdate>, <Pastverb>.

The seed terms we have defined are expanded using terms in thesaurus.com and relatedwords.org. We have assumed that each information type entity would give additional information for classification. For example, the entities such as <Government> and <Organization> would be helpful for the category 'Official', an entity <News> for the category 'ContinuingNews', an entity <Disaster/threat> for the category 'EmergingThreats', and an entity <Evacuation> for the category 'MovePeople'.

2.4 URL Entities

A URL itself has information as represented by the URL. We have defined URL entities such as <URLVideo>, <URLPhoto>, <URLNews>, <URLWeather>, <URLOrganization>, <URLDonation>, <URLDisaster> and <URLBlogMagazine>.

We have assumed that each URL entity would give information for a category. For example, the entity <URLDonation> would be helpful for the category 'Donations', an entity <URLWeather> for the category 'Weather', an entity <URLOrganization> for the category 'Official', an entity <URLNews> for the categories 'ContinuingNews' or 'PastNews', an entity <URLDisasterInfo> for the category 'EmergingThreats', and the entities <URLVideo> and <URLPhoto> for the category 'MultimediaShare'.

In order to extract useful URLs, the TREC-IS training data and CrisisT26 dataset are used. For pre-processing, a short URL is converted to an original URL. The URLs with more than frequency 10 have been defined as 8 entities. The followings are term lists for each URL entity.

2.5 User Entities

Based on a user's characteristics, we have defined user entities such as <UserNews>, <UserWeather>, <UserOrganization>, <UserDonation>, <UserDisasterInfo>, and <UserMultimedia>.

In order to extract useful user information in tweets, the TREC-IS training data and CrisisT26 dataset are used. The users with more than frequency 10 have been defined as 6 user entities as the characteristics. We have assumed that a user entity <UserDonation> would be helpful for the category 'Donations', a user entity <UserWeather> for the category 'Weather', an entity <UserNews> for the categories 'Factoid', 'ContinuingNews' or 'PastNews', an entity <UserOrganization> for the category 'Official', and an entity <UserMultimedia> for the category 'MultimediaShare'.

3 Combining Deep Learning Models

In image localization, class activation mapping highlights the class-specific discriminative regions ^[¹⁰^]. Class activation mapping can identify the importance of the image regions most relevant to the particular category. We expect that the class activation map for text classification detect the importance of terms.

On the other hand, there has been research using one-shot learning in Long-Short Term Memory. One-shot learning is effective when the number of training data is too small to learn ^[¹¹^,¹²^]. It improves classification accuracy.

We combine CAM with one-shot learning in CNNs as shown in Fig. 1. We expect crisis-related terms are continuously changed through the learning of term weights with high importance.

Fig. 1 Combining deep learning models with CAM and one-shot learning in CNNs

3.1 Class Activation Mapping for Text Classification

In this paper, class activation mapping is applied for text classification. Fig. 2 shows CAM applied for image localization and text classification.

Fig. 2 Class activation mapping for image localization and text classification

The class activation mapping is generated by multiplying the weight from the convolution layer and the weight used for classification using the global average pooling in CNNs. This allows us to check the importance of terms in the input data. Through learning, global average pooling creates class activation mapping and represents terms. The proposed method readjusts the terms weight in order of importance and re-inputs them.

3.2 Features for One-shot Learning

CNNs consists of a convolutional layer, a maximum pooling layer, and a fully connected layer ^[⁹^]. One-shot learning uses CNN's document vectors that come from fully connected layer and category vectors that represent each category. One-shot learning finds a category vector corresponding to the matched category, learns the index of similarity of the document vector and sets the value to 1. In the other case of finding, the category vector corresponding to the unmatched category, it sets the value of index of similarity to zero.

4 Experiments

4.1 Test Collection

In TREC Incident Streams (TREC-IS) 2018 test collection, a stream of tweets is collected by 19 events from the CrisisLex or CrisisNLP initiatives using hashtags and keyword monitoring ^[¹⁴^,¹⁵^]. The tweets are related to the incidents of six different types, e.g. earthquakes, hurricanes, public or shootings ^[¹³^]. The training set contains six event topics such as 2012 Colorado wildfires, 2012 Costa Rica Earthquake, 2013 Colorado floods, 2012 Typhoon Pablo, 2013 LA Airport Shooting, and 2013 West Texas Explosion. The test set contains 15 event topics such as 2012 Guatemala earthquake, 2012 Philippines floods, 2013 Australia bushfire, 2013 Boston bombings, 2011 Joplin tornado, 2015 Paris attacks and etc.

The TREC-IS task is classifying the tweets into 25 information types. The information type categories, training data and test data for each category are shown in Table 6. The number of tweets in a training set is 1335, whereas the number of tweets in a test set is 22160.

Since the human assessors were allowed to select as many information types when creating the ground truth, the performance is measured in two ways: any-type and multi-type. In any-type evaluation, a system receives a full score for a tweet if it assigned any of the categories that the human assessor selected for that tweet. In the multi-type evaluation, the performance per information type is calculated in a 1 vs. all manner ^[¹³^]. As evaluation metrics, Precision (P), Recall (R), F1 score (F1), and Accuracy (A) are measured.

4.2 Comparison Experiments

In order to see the effectiveness of the proposed conceptual representation, we have compared our method with baseline term representation for two machine learning methods: support vector machines (SVM) ^[¹⁶^] and combining deep learning models:

— baseSVM: SVM with term representation.
— baseComb: Combining deep learning models with term representation.
— conceptSVM: SVM with conceptual representation.
— conceptComb: Combining deep learning models with conceptual representation.

We have compared our method with TREC-IS 2018 track participants with high performance:

— KDEIS4_DM: CNNs with bidirectional LSTM ^[⁶^].
— umdhcilfasttext: FastText using pre-trained word vector datasets on Wikipedia ^[⁷^].
— NHK: Simple Multilayer perceptron by inputting text BoW vector ^[⁸^].

4.3 Experimental Results

Table 5 shows the experimental results of information type categorization for any-type.

Table 5 Experimental results for information type

Method	F₁	Accuracy
baseSVM	0.555	0.402
baseComb	0.497	0.340
conceptSVM	0.575	0.421
conceptComb	0.533	0.389
KDEIS4_DM	0.560	0.391
Umdhcilfasttext	0.558	0.402
NHK	0.551	0.400

The proposed conceptual representation achieved significant improvement in SVM and deep learning models compared to the baseline representation. The proposed method showed high performance compared to other participant methods.

In this experiments, SVM showed better performance than deep learning models. It is due to the small number of a training data for each category even though we adopted one-shot learning method. Table 6 shows results per each information type performance for multi-type.

Table 2 Information type categorization (multi-type) per information type performance in F₁ measure

Information Type Category		Train	Test	baseSVM	conceptSVM	baseComb	conceptComb
Information Type Category		data	data	baseSVM	conceptSVM	baseComb	conceptComb
Request	GoodsServices	0	126	0	0	0	0
	SearchAndRescue	0	286	0	0	0	0
	InformationWanted	10	172	0.03	0.02	0	0
CallToAction	Volunteer	2	116	0	0	0	0
	Donations	15	804	0.46	0.47	0.09	0.53
	MovePeople	26	27	0.06	0.07	0.05	0.04
Report	FirstPartyObservation	28	3807	0.01	0.01	0	0
	ThirdPartyObservation	15	4160	0	0	0	0
	Weather	42	1325	0.15	0.25	0.12	0.24
	EmergingThreats	36	686	0.02	0.03	0	0.01
	SignificantEventChange	34	415	0.01	0.01	0	0.01
	MultimediaShare	127	3974	0.36	0.43	0.19	0.34
	ServiceAvailable	15	1076	0.05	0.15	0.01	0.06
	Factoid	140	2383	0.23	0.25	0.15	0.21
	Official	52	403	0.08	0.08	0.06	0.10
	CleanUp	2	62	0	0	0	0
	Hashtags	4	3363	0	0	0	0
Other	PastNews	12	1351	0	0.01	0.03	0
	ContinuingNews	251	4871	0.41	0.36	0.36	0.36
	Advice	39	1209	0.10	0.08	0.07	0.09
	Sentiment	132	6952	0.52	0.53	0.52	0.55
	Discussion	51	2060	0.08	0.07	0.06	0.04
	Irrelevant	163	2605	0.20	0.22	0.12	0.22
	Unknown	26	77	0	0	0.02	0.01
	KnownAlready	113	1101	0.10	0.11	0.08	0.11

5 Conclusions

In this paper, we proposed conceptual representation for terms, URLs and users in a tweet in order to classify a stream of tweets related to the incident. The crisis-related terms were represented as conceptual entities such as events, category indicators, information types, URLs, and user entities. We have assumed that each conceptual entity would give additional information for information type categorization. Experimental results showed the conceptual representation was effective for crisis-related tweet classification with SVM and CNNs.

Acknowledgements

This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education(NRF-2017R1D1A1B03036275).

References

1. Vieweg, S., Castillo, C., & Imran, M. (2014). Integrating Social Media Communications into the Rapid Assessment of Sudden Onset Disasters. Proceedings of International Conference on Social Informatics, pp. 444-461. DOI: 10.1007/978-3-319-13734-6_32. [ Links ]

2. Rudra, K., Banerjee, S., Ganguly, N., Goya, l.P., Imran, M., & Mitra, P. (2016). Summarizing situational tweets in crisis scenario. Proceedings of the 27th ACM Conference on Hypertext and Social Media, pp. 137-147. DOI: 10.1145/2914586. 2914600. [ Links ]

3. Nguyen, D.T., Al-Mannai, K., Joty, S., Sajjad, H., Imran, M., & Mitra, P. (2017). Robust Classification of Crisis-Related Data on Social Networks Using Convolutional Neural Networks. Proceedings of the Eleventh International AAAI Conference on Web and Social Media (ICWSM), pp. 632-635. [ Links ]

4. Rudra, K., Banerjee, S., Ganguly, N., Goyal, P., Imran, M., & Mitra, P. (2016). Summarizing Situational Tweets in Crisis Scenario. Proceedings of the 27th ACM Conference on Hypertext and Social Media, pp.137-147. DOI: 10.1145/2914586.2914600. [ Links ]

5. McCreadie, R., Buntain, C., & Soboroff, I. (2018). TREC 2018 Incident Streams Track Guidelines. http://dcs.gla.ac.uk/~richardm/TREC_IS. [ Links ]

6. Chy, A.N., Siddiqua, U.A., & Aono, M. (2018). Neural Networks and Support Vector Machine based Approach for Classifying Tweets by Information Types at (TREC'18) Incident Streams Task. Proceedings of the 26th Text Retrieval Conference, pp. 1 -7. [ Links ]

7. Buntain, C. (2018). Learning Information Types in Social Media for Crises. Proceedings of the 26th Text Retrieval Conference. [ Links ]

8. Miyazaki, T., Makino, K., Takei, Y., Okamoto, H., & Goto, J. (2018). NHK STRL at TREC 2018 Incident Streams track. Proceedings of the 26th Text Retrieval Conference. pp. 1 -7. [ Links ]

9. Kim, Y. (2014). Convolutional neural networks for sentence classification. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1746-1751. [ Links ]

10. Zhou, B., Khosla, A., Lapedriza, A., Oliva, A., & Torralba, A. (2016). Learning Deep Features for Discriminative Localization. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2921 -2929. [ Links ]

11. Vinyals, O., Blundell, C., Lillicrap, T., & Wierstra, D. (2016). Matching networks for one shot learning. Advances in Neural Information Processing Systems, pp. 3630-3638. [ Links ]

12. Akata, Z., Perronnin, F., Harchaoui, Z., & Schmid, C. (2015). Label-embedding for image classification. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 38, No. 7, pp. 1425-1438. [ Links ]

13. McCreadie, R. & Soboroff, I. (2018). TREC 2018 Incident Streams Track Overview Paper. Proceedings of the 26th Text Retrieval Conference. [ Links ]

14. Olteanu, A., Castillo, C., Diaz, F., & Vieweg, S. (2014). CrisisLex: A Lexicon for Collecting and Filtering Microblogged Communications in Crises. Proceedings of the Eighth International AAAI Conference on Weblogs and Social Media (ICWSM'14), pp.376-385. [ Links ]

15. Imran, M., Mitra, P., & Castillo, C. (2016). Twitter as a Lifeline: Human-annotated Twitter Corpora for NLP of Crisis-related Messages. Proceedings of the 10th Language Resources and Evaluation Conference, pp. 1638-1643. [ Links ]

16. Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., & Duchesnay, E. (2011). Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research (JMLR), pp. 2825-2830. [ Links ]

Received: December 21, 2018; Accepted: March 05, 2019

^* Corresponding author is Kyung-Soon Lee. selfsolee@jbnu.ac.kr

This is an open-access article distributed under the terms of the Creative Commons Attribution License