Please enable JavaScript.

Coggle requires JavaScript to display documents.

2016 - Detecting sarcasm in customer tweets: an NLP based approach…

- - - - Function words
        
        The words that have little or no significant meaning outside the premise of the sentence.
      - part of speech n-grams
      - Various combinations
        
        Content words + function words
        
        We have also extracted both these feature types. together and used as a single feature type to capture both style- and topic-based features.
        
        Function words +
        part of speech
        n-grams
        
        We have combined function words and part of speech n-grams and used them as a single feature for classification. These features exclusively capture style-based features
        
        Content words + function words +
        part of speech n-grams
        
        Using this feature type we have tried to capture both style-based as well as topic-based features.
      - Part of speech tags
  - - - We downloaded around 15,000 tweets using hashtags such as #sarcasm, #sarcastic along with sincere tweets using R software
      - To train and test the classifiers, the data were split into two sets randomly. The data set was divided into a ratio of 3:1. The mentioned ratio has been extensively applied in classification literature (Schürer and Muskal, 2013).
      - A tenfold cross-validation was performed on the training set. In choosing the training testing ratio, the stress is on generalizability of the results which is achieved by the K-fold cross-validation as explained later in this section (Domingos, 2012)
    - - One needs to ensure that the training data does not overfit the training set as it could drastically distort the result for the test set. This is usually addressed by the K-fold cross-validation.
    - - Type of classficiation in NLP
        
        Generative
        
        Learn the joint probability of the inputs and the labels (classes like in our case sarcasm/non-sarcasm), and make the prediction by using the Bayes rule to select the most likely label
        
        Discriminative
        
        Model the posterior probability directly or learn a direct map of inputs to the class label (Ng and Jordan, 2002)
      - Classification Model
        
        Naïve Bayesian Classifier
        
        We considered a document vector model (Manning and Schutze, 1999) for representing a document with the help of terms which can be used as inputs.
        
        Maximum entropy classifier
        
        Unlike the Naïve Bayes classifier, the maximum entropy classifier does not assume that the features are conditionally independent of each other. Maximum entropy is therefore a less restrictive model than Naïve Bayesian model
        
        It is based on the principle of maximum entropy and from all the models which fit the training data, it selects the one which has the highest entropy.
        
        The maximum entropy classifier requires more time to train compared to Naïve Bayes due to the optimization problem that needs to be solved in order to estimate the parameters of the model.
      - We have formulated both the types of classification models, the Naïve Bayes model (generative classifier) and the maximum entropy model (discriminative classifier).
- - - - This makes sarcasm detection from unstructured text data a relevant and challenging problem. This is also because it is unaided by any visual or vocal cues that assist humans in understanding sarcasm.
      - One of the major issues in sarcasm detection is the absence of naturally occurring expressions that can be used for training purposes (Davidov et al., 2010).