<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Chunk Extraction with NLTK</title>
	<atom:link href="http://streamhacker.com/2009/02/23/chunk-extraction-with-nltk/feed/" rel="self" type="application/rss+xml" />
	<link>http://streamhacker.com/2009/02/23/chunk-extraction-with-nltk/#utm_source=feed&#038;utm_medium=feed&#038;utm_campaign=feed</link>
	<description>Weotta be Hacking</description>
	<lastBuildDate>Sun, 05 Feb 2012 22:47:34 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
<atom:link rel="hub" href="http://pubsubhubbub.appspot.com" />
	<atom:link rel="hub" href="http://superfeedr.com/hubbub" />
		<item>
		<title>By: Graduation project &#124; david.graus</title>
		<link>http://streamhacker.com/2009/02/23/chunk-extraction-with-nltk/comment-page-1/#comment-894</link>
		<dc:creator>Graduation project &#124; david.graus</dc:creator>
		<pubDate>Wed, 31 Aug 2011 12:53:32 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.wordpress.com/?p=164#comment-894</guid>
		<description>[...] Possibly looking at keyphrase extraction instead of simple keywords [or maybe looking at word collocations/chunk extraction?].  [...]</description>
		<content:encoded><![CDATA[<p>[...] Possibly looking at keyphrase extraction instead of simple keywords [or maybe looking at word collocations/chunk extraction?].  [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: dvdgrs » Graduation project</title>
		<link>http://streamhacker.com/2009/02/23/chunk-extraction-with-nltk/comment-page-1/#comment-855</link>
		<dc:creator>dvdgrs » Graduation project</dc:creator>
		<pubDate>Mon, 11 Jul 2011 14:12:04 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.wordpress.com/?p=164#comment-855</guid>
		<description>[...] Possibly looking at keyphrase extraction instead of simple keywords [or maybe looking at word collocations/chunk extraction?].  [...]</description>
		<content:encoded><![CDATA[<p>[...] Possibly looking at keyphrase extraction instead of simple keywords [or maybe looking at word collocations/chunk extraction?].  [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jacob Perkins</title>
		<link>http://streamhacker.com/2009/02/23/chunk-extraction-with-nltk/comment-page-1/#comment-826</link>
		<dc:creator>Jacob Perkins</dc:creator>
		<pubDate>Wed, 04 May 2011 00:12:00 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.wordpress.com/?p=164#comment-826</guid>
		<description>you missed a closing parens for join, should be &#039;&#039;.join([w, t, c])</description>
		<content:encoded><![CDATA[<p>you missed a closing parens for join, should be &#8221;.join([w, t, c])</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nk</title>
		<link>http://streamhacker.com/2009/02/23/chunk-extraction-with-nltk/comment-page-1/#comment-825</link>
		<dc:creator>Nk</dc:creator>
		<pubDate>Wed, 04 May 2011 00:05:00 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.wordpress.com/?p=164#comment-825</guid>
		<description>Your code doesn&#039;t seem to work, there is a syntax error a line 16 &quot;lines = [&#039; &#039;.join([w, t, c] for (w, (t, c)) in wtc if c]&quot;</description>
		<content:encoded><![CDATA[<p>Your code doesn&#8217;t seem to work, there is a syntax error a line 16 &#8220;lines = [' '.join([w, t, c] for (w, (t, c)) in wtc if c]&#8221;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: What&#039;s the best way to extract phrases from a corpus of text using Python? - Quora</title>
		<link>http://streamhacker.com/2009/02/23/chunk-extraction-with-nltk/comment-page-1/#comment-722</link>
		<dc:creator>What&#039;s the best way to extract phrases from a corpus of text using Python? - Quora</dc:creator>
		<pubDate>Tue, 09 Nov 2010 14:52:19 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.wordpress.com/?p=164#comment-722</guid>
		<description>[...] date here&#160;&#160;Vineet Yadav, M.Tech Computational linguistics , II... Train nltk chunker(http://streamhacker.com/2009/02/...)Like NLTK, Montilingua( http://web.media.mit.edu/~hugo/m...) is end to end NLP framework, it uses [...]</description>
		<content:encoded><![CDATA[<p>[...] date here&nbsp;&nbsp;Vineet Yadav, M.Tech Computational linguistics , II&#8230; Train nltk chunker(http://streamhacker.com/2009/02/&#8230;)Like NLTK, Montilingua( <a href="http://web.media.mit.edu/~hugo/m.." rel="nofollow">http://web.media.mit.edu/~hugo/m..</a>.) is end to end NLP framework, it uses [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Learning to do natural language processing with NLTK &#124; JetLlib Journal</title>
		<link>http://streamhacker.com/2009/02/23/chunk-extraction-with-nltk/comment-page-1/#comment-542</link>
		<dc:creator>Learning to do natural language processing with NLTK &#124; JetLlib Journal</dc:creator>
		<pubDate>Sun, 04 Apr 2010 22:13:12 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.wordpress.com/?p=164#comment-542</guid>
		<description>[...] Chunk Extraction with NLTK [...]</description>
		<content:encoded><![CDATA[<p>[...] Chunk Extraction with NLTK [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jacob</title>
		<link>http://streamhacker.com/2009/02/23/chunk-extraction-with-nltk/comment-page-1/#comment-31</link>
		<dc:creator>Jacob</dc:creator>
		<pubDate>Tue, 30 Jun 2009 15:10:01 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.wordpress.com/?p=164#comment-31</guid>
		<description>Patrick, thanks for catching the typo. I&#039;ve updated the article with the correct code.</description>
		<content:encoded><![CDATA[<p>Patrick, thanks for catching the typo. I&#8217;ve updated the article with the correct code.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Patrick</title>
		<link>http://streamhacker.com/2009/02/23/chunk-extraction-with-nltk/comment-page-1/#comment-30</link>
		<dc:creator>Patrick</dc:creator>
		<pubDate>Tue, 30 Jun 2009 08:47:41 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.wordpress.com/?p=164#comment-30</guid>
		<description>Nice article.

Slight typo (missing a closing parenthesis after c]) in line #16
16.        lines = [&#039; &#039;.join([w, t, c] for (w, (t, c)) in wtc if c]
to
16.        lines = [&#039; &#039;.join([w, t, c]) for (w, (t, c)) in wtc if c]</description>
		<content:encoded><![CDATA[<p>Nice article.</p>
<p>Slight typo (missing a closing parenthesis after c]) in line #16<br />
16.        lines = [' '.join([w, t, c] for (w, (t, c)) in wtc if c]<br />
to<br />
16.        lines = [' '.join([w, t, c]) for (w, (t, c)) in wtc if c]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: willdampier</title>
		<link>http://streamhacker.com/2009/02/23/chunk-extraction-with-nltk/comment-page-1/#comment-23</link>
		<dc:creator>willdampier</dc:creator>
		<pubDate>Thu, 26 Feb 2009 05:12:47 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.wordpress.com/?p=164#comment-23</guid>
		<description>I&#039;m trying to use the results of the pos tagger (and now chunker) to pull out words and phrases to use as features in a classification problem as I talk about in http://megamicrobase.wordpress.com/2009/02/26/featuring-the-featureset/ ... since I&#039;m dealing with mostly biomedical annotations I think I&#039;ll gain a lot of specificity by switching to a medical corpus for the pos tagging and chunk parser.</description>
		<content:encoded><![CDATA[<p>I&#8217;m trying to use the results of the pos tagger (and now chunker) to pull out words and phrases to use as features in a classification problem as I talk about in <a href="http://megamicrobase.wordpress.com/2009/02/26/featuring-the-featureset/" rel="nofollow">http://megamicrobase.wordpress.com/2009/02/26/featuring-the-featureset/</a> &#8230; since I&#8217;m dealing with mostly biomedical annotations I think I&#8217;ll gain a lot of specificity by switching to a medical corpus for the pos tagging and chunk parser.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jacob</title>
		<link>http://streamhacker.com/2009/02/23/chunk-extraction-with-nltk/comment-page-1/#comment-22</link>
		<dc:creator>Jacob</dc:creator>
		<pubDate>Thu, 26 Feb 2009 03:09:04 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.wordpress.com/?p=164#comment-22</guid>
		<description>If you know the exact patterns you&#039;re looking for, you can also use the &lt;a href=&quot;http://nltk.googlecode.com/svn/trunk/doc/api/nltk.chunk.regexp.RegexpParser-class.html&quot; rel=&quot;nofollow&quot;&gt;RegexpParser&lt;/a&gt;. It&#039;ll be a lot more accurate than itertools.groupby :)</description>
		<content:encoded><![CDATA[<p>If you know the exact patterns you&#8217;re looking for, you can also use the <a href="http://nltk.googlecode.com/svn/trunk/doc/api/nltk.chunk.regexp.RegexpParser-class.html" rel="nofollow">RegexpParser</a>. It&#8217;ll be a lot more accurate than itertools.groupby <img src='http://streamhacker.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
</channel>
</rss>

