<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments for StreamHacker</title>
	<atom:link href="http://streamhacker.com/comments/feed/" rel="self" type="application/rss+xml" />
	<link>http://streamhacker.com</link>
	<description>Weotta be Hacking</description>
	<lastBuildDate>Thu, 19 Apr 2012 12:53:00 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
<atom:link rel="hub" href="http://pubsubhubbub.appspot.com" />
	<atom:link rel="hub" href="http://superfeedr.com/hubbub" />
		<item>
		<title>Comment on Execnet vs Disco for Distributed NLTK by Hadoop MapReduce</title>
		<link>http://streamhacker.com/2009/12/14/execnet-disco-distributed-nltk/comment-page-1/#comment-951</link>
		<dc:creator>Hadoop MapReduce</dc:creator>
		<pubDate>Thu, 19 Apr 2012 12:53:00 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.com/?p=765#comment-951</guid>
		<description>Nice Post dude.. Keep it up.</description>
		<content:encoded><![CDATA[<p>Nice Post dude.. Keep it up.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Part of Speech Tagging with NLTK Part 1 &#8211; Ngram Taggers by VasilisG</title>
		<link>http://streamhacker.com/2008/11/03/part-of-speech-tagging-with-nltk-part-1/comment-page-1/#comment-949</link>
		<dc:creator>VasilisG</dc:creator>
		<pubDate>Sun, 08 Apr 2012 19:55:00 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.wordpress.com/?p=3#comment-949</guid>
		<description>Hi Jacob, when I try to create the ubt I receive the following error:
 
Traceback (most recent call last):
  File &quot;&quot;, line 1, in 
    ubt_tagger = backoff_tagger(train_sents, [nltk.tag.UnigramTagger, nltk.tag.BigramTagger, nltk.tag.TrigramTagger])
  File &quot;&quot;, line 3, in backoff_tagger
    backoff = tagger_classes[0](tagged_sents)
  File &quot;/usr/local/lib/python2.6/dist-packages/nltk/tag/sequential.py&quot;, line 317, in __init__
    backoff, cutoff, verbose)
  File &quot;/usr/local/lib/python2.6/dist-packages/nltk/tag/sequential.py&quot;, line 274, in __init__
    self._train(train, cutoff, verbose)
  File &quot;/usr/local/lib/python2.6/dist-packages/nltk/tag/sequential.py&quot;, line 177, in _train
    tokens, tags = zip(*sentence)
ValueError: need more than 1 value to unpack

Is it because some change in NLTK&#039;s API?

Thanks,
Vasilis</description>
		<content:encoded><![CDATA[<p>Hi Jacob, when I try to create the ubt I receive the following error:<br />
 <br />
Traceback (most recent call last):<br />
  File &#8220;&#8221;, line 1, in<br />
    ubt_tagger = backoff_tagger(train_sents, [nltk.tag.UnigramTagger, nltk.tag.BigramTagger, nltk.tag.TrigramTagger])<br />
  File &#8220;&#8221;, line 3, in backoff_tagger<br />
    backoff = tagger_classes[0](tagged_sents)<br />
  File &#8220;/usr/local/lib/python2.6/dist-packages/nltk/tag/sequential.py&#8221;, line 317, in __init__<br />
    backoff, cutoff, verbose)<br />
  File &#8220;/usr/local/lib/python2.6/dist-packages/nltk/tag/sequential.py&#8221;, line 274, in __init__<br />
    self._train(train, cutoff, verbose)<br />
  File &#8220;/usr/local/lib/python2.6/dist-packages/nltk/tag/sequential.py&#8221;, line 177, in _train<br />
    tokens, tags = zip(*sentence)<br />
ValueError: need more than 1 value to unpack</p>
<p>Is it because some change in NLTK&#8217;s API?</p>
<p>Thanks,<br />
Vasilis</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Mnesia Records to MongoDB Documents by Jacob Perkins</title>
		<link>http://streamhacker.com/2010/02/01/mnesia-records-mongodb-documents/comment-page-1/#comment-948</link>
		<dc:creator>Jacob Perkins</dc:creator>
		<pubDate>Tue, 27 Mar 2012 22:34:00 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.com/?p=916#comment-948</guid>
		<description>The idea with plists is to be a drop-in replacement for functions in lists, with the option of customizing the parallelization by specifying a Malt. ptable then expands on that, but instead of providing a list, you give a mnesia table name and match specification. So for the ptable:foreach functions, F is the function to call on each record in mnesia, Table and Spec are passed thru to mnesia:select in order to select the objects, and Malt is passed to plists:foreach along with F and a list of objects from mnesia. I don&#039;t have an example handy since I haven&#039;t used this code in years, so I recommend reading up on plists &amp; Malt, and mnesia match specifications.</description>
		<content:encoded><![CDATA[<p>The idea with plists is to be a drop-in replacement for functions in lists, with the option of customizing the parallelization by specifying a Malt. ptable then expands on that, but instead of providing a list, you give a mnesia table name and match specification. So for the ptable:foreach functions, F is the function to call on each record in mnesia, Table and Spec are passed thru to mnesia:select in order to select the objects, and Malt is passed to plists:foreach along with F and a list of objects from mnesia. I don&#8217;t have an example handy since I haven&#8217;t used this code in years, so I recommend reading up on plists &amp; Malt, and mnesia match specifications.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Mnesia Records to MongoDB Documents by DG</title>
		<link>http://streamhacker.com/2010/02/01/mnesia-records-mongodb-documents/comment-page-1/#comment-947</link>
		<dc:creator>DG</dc:creator>
		<pubDate>Tue, 27 Mar 2012 20:17:00 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.com/?p=916#comment-947</guid>
		<description>Hi Jacob,
I&#039;m a newbie to erlang (but quite solid in C and DBMS like sqlite, postgres, db2) and trying to use mnesia for a project at university. I found really interesting your comments here and more interesting your elib code... the problem is I&#039;m not able to figure out how to use it :( 
My intentions would be to have parallel processes (read, math functions, write) on data from one or more tables (possibly partitioning on several nodes). I&#039;ve seen you implemented ptable in elib using plist (by you and Stephen Marsh)...

I searched a while for someone using it but seems difficult, surely i&#039;m missing something...
Could you post an (even extremely) simple example of the combination ptable-mnesia?</description>
		<content:encoded><![CDATA[<p>Hi Jacob,<br />
I&#8217;m a newbie to erlang (but quite solid in C and DBMS like sqlite, postgres, db2) and trying to use mnesia for a project at university. I found really interesting your comments here and more interesting your elib code&#8230; the problem is I&#8217;m not able to figure out how to use it <img src='http://streamhacker.com/wp-includes/images/smilies/icon_sad.gif' alt=':(' class='wp-smiley' />  <br />
My intentions would be to have parallel processes (read, math functions, write) on data from one or more tables (possibly partitioning on several nodes). I&#8217;ve seen you implemented ptable in elib using plist (by you and Stephen Marsh)&#8230;</p>
<p>I searched a while for someone using it but seems difficult, surely i&#8217;m missing something&#8230;<br />
Could you post an (even extremely) simple example of the combination ptable-mnesia?</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Text Classification for Sentiment Analysis &#8211; Naive Bayes Classifier by Hiral</title>
		<link>http://streamhacker.com/2010/05/10/text-classification-sentiment-analysis-naive-bayes-classifier/comment-page-1/#comment-946</link>
		<dc:creator>Hiral</dc:creator>
		<pubDate>Fri, 09 Mar 2012 14:19:00 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.com/?p=1180#comment-946</guid>
		<description> hi i , i am working on sentiment analysis  same positive and negative i want to use 
naive bayes but i dnt know technique plz guide me from basic thank you </description>
		<content:encoded><![CDATA[<p> hi i , i am working on sentiment analysis  same positive and negative i want to use<br />
naive bayes but i dnt know technique plz guide me from basic thank you</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Text Classification for Sentiment Analysis &#8211; Stopwords and Collocations by Jacob Perkins</title>
		<link>http://streamhacker.com/2010/05/24/text-classification-sentiment-analysis-stopwords-collocations/comment-page-1/#comment-944</link>
		<dc:creator>Jacob Perkins</dc:creator>
		<pubDate>Wed, 08 Feb 2012 15:44:00 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.com/?p=1227#comment-944</guid>
		<description>words is not explicitly defined above, but it&#039;s a function parameter that is expected to be a list of strings. featx is also a function parameter, but it&#039;s expected to be a function that accepts words and returns a dict. This way, you can pass different featx functions to evaluate_classifier to see the different results.</description>
		<content:encoded><![CDATA[<p>words is not explicitly defined above, but it&#8217;s a function parameter that is expected to be a list of strings. featx is also a function parameter, but it&#8217;s expected to be a function that accepts words and returns a dict. This way, you can pass different featx functions to evaluate_classifier to see the different results.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Text Classification for Sentiment Analysis &#8211; Stopwords and Collocations by Fredrik</title>
		<link>http://streamhacker.com/2010/05/24/text-classification-sentiment-analysis-stopwords-collocations/comment-page-1/#comment-943</link>
		<dc:creator>Fredrik</dc:creator>
		<pubDate>Wed, 08 Feb 2012 14:37:00 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.com/?p=1227#comment-943</guid>
		<description>I am quite new to Python, and some parts of the code seems more or less magic to me... I have understood that functions are just ordinary objects/values in Python and I guess that this is the trick, but can you explain or suggest a good link for explaining how the following parts of the code work? The name word_feats seems to be bounded to the function word_feats, but what is words bound too? I guess it is bound to featx through function evaluate_classifier, but I really don&#039;t get how featx is assigned a value in 
negfeats = [(featx(movie_reviews.words(fileids=[f])), &#039;neg&#039;) for f in negids] (to me it looks like featx is a function here, but I guess it is not? I guess that I should do some basic reading about Python, but any clarification would be helpful.

</description>
		<content:encoded><![CDATA[<p>I am quite new to Python, and some parts of the code seems more or less magic to me&#8230; I have understood that functions are just ordinary objects/values in Python and I guess that this is the trick, but can you explain or suggest a good link for explaining how the following parts of the code work? The name word_feats seems to be bounded to the function word_feats, but what is words bound too? I guess it is bound to featx through function evaluate_classifier, but I really don&#8217;t get how featx is assigned a value in <br />
negfeats = [(featx(movie_reviews.words(fileids=[f])), &#8216;neg&#8217;) for f in negids] (to me it looks like featx is a function here, but I guess it is not? I guess that I should do some basic reading about Python, but any clarification would be helpful.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Text Classification for Sentiment Analysis &#8211; Naive Bayes Classifier by &#187; A Text Analysis of Supreme Court Oral Arguments jarv.org</title>
		<link>http://streamhacker.com/2010/05/10/text-classification-sentiment-analysis-naive-bayes-classifier/comment-page-1/#comment-942</link>
		<dc:creator>&#187; A Text Analysis of Supreme Court Oral Arguments jarv.org</dc:creator>
		<pubDate>Sun, 05 Feb 2012 22:47:34 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.com/?p=1180#comment-942</guid>
		<description>[...] For more information about sentiment analysis there is some good information here and in these two articles. Applying this to oral arguments? Well let&#8217;s leave it as just one way to look at [...]</description>
		<content:encoded><![CDATA[<p>[...] For more information about sentiment analysis there is some good information here and in these two articles. Applying this to oral arguments? Well let&#8217;s leave it as just one way to look at [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Fuzzy String Matching in Python by Jacob Perkins</title>
		<link>http://streamhacker.com/2011/10/31/fuzzy-string-matching-python/comment-page-1/#comment-940</link>
		<dc:creator>Jacob Perkins</dc:creator>
		<pubDate>Sun, 25 Dec 2011 15:55:00 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.com/?p=1815#comment-940</guid>
		<description>I&#039;ve never heard of anyone doing that, and I&#039;m not sure how well it would work, because you can&#039;t necessarily break a string down into pieces and match those pieces to match the whole string. For larger strings that might make sense, but the smaller the string, the more exact you should be.</description>
		<content:encoded><![CDATA[<p>I&#8217;ve never heard of anyone doing that, and I&#8217;m not sure how well it would work, because you can&#8217;t necessarily break a string down into pieces and match those pieces to match the whole string. For larger strings that might make sense, but the smaller the string, the more exact you should be.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Fuzzy String Matching in Python by Christopher Stoll</title>
		<link>http://streamhacker.com/2011/10/31/fuzzy-string-matching-python/comment-page-1/#comment-939</link>
		<dc:creator>Christopher Stoll</dc:creator>
		<pubDate>Sun, 25 Dec 2011 05:46:00 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.com/?p=1815#comment-939</guid>
		<description>You might be able to replace these techniques (normalization, regex, etc.) with a single dynamic programming algorithm, no? I guess it probably depends upon what you are trying to accomplish though.</description>
		<content:encoded><![CDATA[<p>You might be able to replace these techniques (normalization, regex, etc.) with a single dynamic programming algorithm, no? I guess it probably depends upon what you are trying to accomplish though.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

