January 22, 2010
Posted in Uncategorized | Leave a Comment »
January 20, 2010
Posted in Uncategorized | Leave a Comment »
December 26, 2009
-
On whether or not Wallace himself is actually an anti-rebel, Scott concludes that he "is less anti-ironic than (forgive me) meta-ironic. That is, his gambit is to turn irony back on itself, to make his fiction relentlessly conscious of its own self-consciousness, and thus to produce work that will be at once unassailably sophisticated and doggedly down to earth. Janus-faced, he demands to be taken at face value. 'Single-entendre principles' is a cleverly tossed off phrase, but Wallace is temperamentally committed to multiplicity—to a quality he has called, with reference to the filmmaker David Lynch, 'bothness.'"
Posted in Uncategorized | Leave a Comment »
November 25, 2009
-
OCRopus(tm) is a state-of-the-art document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multi-lingual capabilities.
Posted in Uncategorized | Leave a Comment »
November 18, 2009
-
Prof. Goel is helping twitter with its reputation system. | Research interests: Methodological: Algorithms, optimization, stochastics, graph theory. Applications: Network and Internet algorithms; molecular algorithms; Internet commerce and social networks.
Posted in Uncategorized | Leave a Comment »
November 9, 2009
-
Wikipedia turns out to have distinct division into communities. Communities in Wikipedia turn out to contain more semantically similar Wikipedia articles. Within such communities, the highest PageRank score typically gets assigned to the article that gives in its title the topic of the whole community.
Posted in Uncategorized | Leave a Comment »
November 3, 2009
-
Last weekend I wrote about how the big social gaming companies are making hundreds of millions of dollars in revenue on Facebook and MySpace through games like Farmville and Mobsters. Users are tricked into these lead gen scams.
-
Michael Arrington posted over the weekend about CPA offers within social games and questioned why facebook, myspace, zynga and others would expose these to our users. He raises good points about ‘scammy’ advertisers and the bad user experience they create. I agree with him and others that some of these offers misrepresent and hurt our industry.
Posted in Uncategorized | Leave a Comment »
October 14, 2009
-
This book describes the important ideas in data mining, machine learning, and bioinformatics in a common conceptual framework. While the approach is statistical, the emphasis is on concepts rather than mathematics. The book's coverage is broad, from supervised learning (prediction) to unsupervised learning. The many topics include neural networks, support vector machines, classification trees, and boosting–the first comprehensive treatment of this topic in any book.
-
A feed fetching and parsing library that treats the internet like Godzilla treats Japan: it dominates and eats all. Feedzirra is a feed library that is designed to get and update many feeds as quickly as possible. This includes using libcurl-multi through the taf2-curb gem for faster http gets, and libxml through nokogiri and sax-machine for faster parsing.
Posted in Uncategorized | Leave a Comment »
October 6, 2009
-
The goal of the django-calais project is to help manage the complexity involved in retrieving, storing, and processing Calais results for your Django models. Essentially it lets you submit any Django model to the Calais service for analysis, then automatically parses the results and stores them in a set of semantic models.
-
Django-Supertagging is an automated tagging application. It is based on Django-Tagging and uses Open-Calais to retrieve the data. Vist the wiki for more information.
-
Our new custom RSS tool is intended for all Times readers — not just developers. It provides a simple way to query the Times Article Search API and a standard way to consume the results. The options for creating a feed are intentionally limited — there’s no way to create a feed for one term OR another, for instance, only combinations of terms — in order to keep the application simple and approachable.
Posted in Uncategorized | Leave a Comment »