Table of Contents

How To Read Effectively

Teaching myself and making an algorithm to assist me in reading effectively. I have not done it much up to this point, so it's okay to fail / be slow!

Video Playback and Annotation

Mozilla HTML5 video documentation, YouTube API. Listen on Repeat is nice, but you can't send a link to a repeat?!?!

=== Sublime Video === (no adjustable playback speed)

Youtube base API

Looks good so far. However, how to add support for variable speed playback?

Markup

I really like Mortimer Adler's suggestions in How To Mark A Book. Underline major statements. Vertical lines at margins for details.

And some lines and arrows to connect the main ideas together in one big mind map.

From Notebook

Random

10/21/15 Notes from Deep Learning Talk

Slides

Word embeddings, word2vec

Look up CoreNLP (the Stanford group)

Deep learning works really well for fuzzy definitions that are hard to pin down in normal ways.

Figure out what you want to do first!

n-grams allow you to capture the nearby context of the word. E. Coli would be one n-gram instead of separate words.

Business
Word2Vec

arxiv.org, he checks it weekly! Is there a way to get to know the *good papers* from there? Federated wiki might allow you to make meta-notes, instead of assuming commenting allowed on that website.

Uses word embedding space to do deep learning on. But it's only using 1 word, why not n-gram?

Read Yav Goldberg natural language processing paper, best overview paper in his opinion.

doc2vec, sentence2vec

Other notes

Hypothes.is seems like most similar project. Another is PeerLibrary.

Xlibris is the best one I've seen yet. Main site, and they did great investigation in how to customize best for legal research.

Liquid Software guy based on Engelbart's work. His Author app more to come on his twitter in November 2014 has a lot of similar features, but it should be on the web! (not just OS X). Also, did an excellent Engelbart documentary.

Draft is another similar thing to this project, but not that similar.

Storage Format

EPUB

Support for Table of Contents arguably not necessary as we will auto-generate it from the text

Active Reading Assistant

Like most of the other research tools out there, except hopefully easier/better.

Assist with de-constructing a book into its “skeleton”/outline format, assisted with your notes and observations. And…maybe even improved with other people's advice.

j/k = up/down   = scroll up and down to next sentence
h = left        = zoom out to next biggest outline
l = right       = zoom in to think about idea more. Open area on right more?

e = del = archive sentence (unnecessary detail). make it gray at next to bottom zoom level. Hide it at higher levels.

Coloring

Already read sentences dim slightly

Text highlighting

Good

Think about more

Force summaries to check understanding?

Resources

Recognizing Hand-Drawn Shapes

GET RID OF SEARCHING FOR MENU OPTIONS. SEARCH FOR THE COMMAND, PRESS THE MOUSE, '?' for KEYBOARD HINTS,etc.

Others

“Document Management Service”, NetDocuments.

Paste Buffer

Paste buffer in Firefox/Chrome should use Web Events paste and access Clipboard. Cross-browser example in Pasteboard (but only does images, and doesn't work in IE?).

IE and Chrome work well enough with test file in project. Firefox, something isn't firing right.

NLTK Book

NLTK Book

Chapter 0

They choose to balance theory and practice.

After completing these materials, then students will be ready to attempt one of the more advanced textbooks, such as Speech and Language Processing by Jurafsky and Martin

People Resources

Parts of Arguments

Name Example Tip-off
Claim The moon is made of cheese is/was
Reason Because
Support
Warrant

Implementation

Stuff about synonym tables, stemming, etc. Sounds cool, not sure what they mean quite yet

Solr LucidWorks Fusion Notes

Incoporate OpenNLP for named entity recognition (people, places, companies). Probably also “he, it, she”, etc.

Lucene has “More like this” functionality

Also, “SpanQuery”, only return the part of document that is relevant

BoostingTermQuery changes weights of terms in query

Caching common searches and showing the “right document” as best result is a nice hack.

Teaching a Computer

recognizes logic, reasoning, fallacies of logic, presents a contrasting view. “Semantic ” ?

Style of Writing

It'd be nice to figure out what makes a good explanation and make sure that you read books with good explanations first.

  1. Information Retrieval, particularly with fact checking and material understanding. Free Recommended IR Book
    1. Automatedly find out facts and explanations of things, particularly from textbooks.
      1. WebMD isn't so great because they just give you the dumbed-down instructions to follow. Why should I wash for 20 seconds? Where are the most important places for me to wash? Someone that knows what they're talking about is here, but doesn't explain the counter-intuitive scientific background.
      2. Maybe there is a heuristic like how many times they say “because”, “as”, “therefore”, or something. Conclusion statements.
      3. “From first principles”, “lucid”, Calculus, An Intuitive and Physical Approach
    2. Quora: How Can ML help solve problems in Journalism?, great summary of recent advances and the vision of the field. Really makes me want to do a PhD on it or work for Google :-).
      1. Another view: “nothing will really change unless we value citing reputable sources over sensational journalism” <and teach people how to read critically>.
    3. If you want to work on it more, here's a few classes. Google it! Theoretical and more practical/hands-on Stanford CS276 and a Wikipedia article: http://en.wikipedia.org/wiki/Automatic_summarization
    4. I want to be able to do syntopical reading and have the computer help me do that. “What does the Bible think about dating” and have it come up with the superclasses and synonyms of dating and provide all the relevant passages associated with it.

Comments on the Internet

TLDR: Find a person you can trust and email them / search their blog or wiki directly :)

Smart Search Engine

Sentiment Analysis

Train using Amazon reviews and the stars given. More tips on edge cases by pulling from Google papers and patents. It would be helpful to search the Google search results for Brian Klemmer (and other life coaches like him) and understand dissenting opinions and facts.

Extract Keywords from Text