Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

SureshAthanti/Probabilistic_Classification_Model

Open more actions menu

Repository files navigation

Probabilistic_Classification_Model

brief description of Data:

The data consists of a tweet which is text data which has two classes[target] whether it is a real disaster or not disaster. Along with the text we have location and keyword.

data attributes used for classification:

Text: it contains the text of tweet. Target: The target attribute consists of two classes disaster/ not disaster.

Classification task:

predicting whether a given tweet is about a real disaster or not a disaster. we have applied various classification/linear techniques and probability measures for the determining the classification task.

List of steps:

  1. Probabilities and Zipf's Law

    a) Rank, Frequency, and Probability distribution

    b) Probability vs. Rank Plot

    c) Regression line fit

  2. Text Vectorization

  3. Terms and Conditional Probabilities distribution

  4. Classification

    a) Probabilistic Naive Bayes Model

    b) Linear Model

    c) Non-linear Classification

  5. Conlcusions

About

Discussion about Probabilities, Classification and Zipf's law

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
Morty Proxy This is a proxified and sanitized view of the page, visit original site.