Please enable JavaScript.

Coggle requires JavaScript to display documents.

Grokking algorithms 3 (Chapter 11: Where to go next (Inverted indexes (a…

- - - - • B-trees
      - • Red-black trees
      - • Heaps
      - • Splay trees
- - - - Dynamic programming starts by
        solving subproblems and builds up to solving the big problem
  - - - The answer doesn’t change. The order of the rows doesn’t matter.
    - - You can’t. With the dynamic-programming solution, you either take the item or not. There’s no way for it to figure out that you should take half an item
- - - - In the grapefruit example, you compared fruit based on how
        big they are and how red they are. Size and color are the features
        you’re comparing
    - - • Classification = categorization into a group
        • Regression = predicting a response (like a number)
      - You’re trying to predict how many
        loaves to make for today. You have a set of features:
        • Weather on a scale of 1 to 5 (1 = bad, 5 = great).
        • Weekend or holiday? (1 if it’s a weekend or a holiday, 0 otherwise.)
        • Is there a game on? (1 if yes, 0 if no.)
        
        => this is to day, Today is a weekend day with good weather
        
        a,b,d,e gần today nhất
        
        Take an average of the loaves sold on those days, and you get 218.75.
        That’s how many loaves you should make for today!
    - - Picking the right features means
        
        • Features that directly correlate to the movies you’re trying to
        recommend
        
        • Features that don’t have a bias (for example, if you ask the users to only rate comedy movies, that doesn’t tell you whether they like action movies)
  - - - OCR stands for optical character recognition. It means you can take a photo of a page of text, and your computer will automatically read the text for you
      - You can use KNN for this:
        
        Go through a lot of images of numbers, and extract features of those numbers.
        
        When you get a new image, extract the features of that image, and see what its nearest neighbors are!
      - The first step of OCR, where you go through images of numbers and extract features, is called training
    - - Spam filters use another simple algorithm called the Naive Bayes
        classifier
      - Suppose you get an email with the subject “collect your million dollars now!” Is it spam?
        
        You can break this sentence into words. Then, for each word, see what the probability is for that word to show up in a spam email
    - - Predicting the future is hard, and it’s almost impossible
        when there are so many variables involved.