Javascript must be enabled to continue!
Uncertainty Detection in Unstructured Big Data
View through CrossRef
It's a world of full of information. Data is one of the important element of this era. One of the major sources of data is social media platforms like Twitter, Facebook etc. Everyday social media generates lot of data. It's a free form of communication where people communicate with each other without any restriction. Users can post anything on social media. However, human tendency of speaking about non-existing or source-less thing makes it unclear or create unuseful information and becomes unreliable information. This casual and word-of mouth form of communication leads to generate uncertain data and quality of information from factuality point of view becomes primary concern in social media. So, uncertainty detection is important in social media. Uncertainty detection in natural language text becomes challenging because dealing with natural language text is pretty complicated thing. Uncertainty is an important field of linguistics. Basically, it means “lack of information”. So, a statement whose truth value cannot be determined is considered as uncertain. Linguistics and Natural Language Processing aims at classifying factual and uncertain proposition. In this chapter, comparisons of classification algorithm have been experimented to detect uncertain propositions in Twitter data of food price crisis. Comparative analysis of classification algorithm Naive Bayes and Support Vector Machine approach is done to detect uncertain propositions of tweets related to food price crisis. A model is trained to classify certain or uncertain proposition using a training file which is annotated using cue words available in English language text. Output of algorithm is the class showing given proposition is certain or uncertain. The objective of this chapter is to have a comparative analysis of text classification approach to detect uncertain events of Twitter data of food price crisis and to improve the accuracy of uncertainty classification approaches in order to detect uncertain events in natural language processing.
Title: Uncertainty Detection in Unstructured Big Data
Description:
It's a world of full of information.
Data is one of the important element of this era.
One of the major sources of data is social media platforms like Twitter, Facebook etc.
Everyday social media generates lot of data.
It's a free form of communication where people communicate with each other without any restriction.
Users can post anything on social media.
However, human tendency of speaking about non-existing or source-less thing makes it unclear or create unuseful information and becomes unreliable information.
This casual and word-of mouth form of communication leads to generate uncertain data and quality of information from factuality point of view becomes primary concern in social media.
So, uncertainty detection is important in social media.
Uncertainty detection in natural language text becomes challenging because dealing with natural language text is pretty complicated thing.
Uncertainty is an important field of linguistics.
Basically, it means “lack of information”.
So, a statement whose truth value cannot be determined is considered as uncertain.
Linguistics and Natural Language Processing aims at classifying factual and uncertain proposition.
In this chapter, comparisons of classification algorithm have been experimented to detect uncertain propositions in Twitter data of food price crisis.
Comparative analysis of classification algorithm Naive Bayes and Support Vector Machine approach is done to detect uncertain propositions of tweets related to food price crisis.
A model is trained to classify certain or uncertain proposition using a training file which is annotated using cue words available in English language text.
Output of algorithm is the class showing given proposition is certain or uncertain.
The objective of this chapter is to have a comparative analysis of text classification approach to detect uncertain events of Twitter data of food price crisis and to improve the accuracy of uncertainty classification approaches in order to detect uncertain events in natural language processing.
Related Results
New Perspectives for 3D Visualization of Dynamic Reservoir Uncertainty
New Perspectives for 3D Visualization of Dynamic Reservoir Uncertainty
This reference is for an abstract only. A full paper was not submitted for this conference.
Abstract
1 Int...
Reserves Uncertainty Calculation Accounting for Parameter Uncertainty
Reserves Uncertainty Calculation Accounting for Parameter Uncertainty
Abstract
An important goal of geostatistical modeling is to assess output uncertainty after processing realizations through a transfer function, in particular, to...
The uncertainty–investment relationship: scrutinizing the role of firm size
The uncertainty–investment relationship: scrutinizing the role of firm size
PurposeThe objective of this paper is threefold. First, it aims to empirically study whether firm-specific/idiosyncratic uncertainty, macroeconomic/aggregate uncertainty and politi...
Structured Codes and Free-Text Notes: Measuring Information Complementarity in Electronic Health Records
Structured Codes and Free-Text Notes: Measuring Information Complementarity in Electronic Health Records
ABSTRACT
Background
Electronic health records (EHRs) consist of both structured data (e.g., diagnostic codes) and unstructured ...
Sampling Space of Uncertainty Through Stochastic Modelling of Geological Facies
Sampling Space of Uncertainty Through Stochastic Modelling of Geological Facies
Abstract
The way the space of uncertainty should be sampled from reservoir models is an essential point for discussion that can have a major impact on the assessm...
Big Data : Analysis
Big Data : Analysis
The amount of data in world is growing day by day. Data is growing because of use of internet, smart phone and social network. Big data is a collection of data sets which is very l...
Contributions to uncertainty in projections of future drought under climate change scenarios
Contributions to uncertainty in projections of future drought under climate change scenarios
Abstract. Drought is a cumulative event, often difficult to define and involving wide reaching consequences for agriculture, ecosystems, water availability, and society. Understand...
Big Data and Official Statistics
Big Data and Official Statistics
Big data is a component of the Fourth Industrial Revolution. The deep penetration of digital technology has turned data into an essential component of the production process. Data ...

