<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: NLTK Classifier Based Chunker Accuracy</title>
	<atom:link href="http://streamhacker.com/2010/03/15/nltk-classifier-based-chunker-accuracy/feed/" rel="self" type="application/rss+xml" />
	<link>http://streamhacker.com/2010/03/15/nltk-classifier-based-chunker-accuracy/#utm_source=feed&#038;utm_medium=feed&#038;utm_campaign=feed</link>
	<description>Weotta be Hacking</description>
	<lastBuildDate>Thu, 19 Apr 2012 12:53:00 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
<atom:link rel="hub" href="http://pubsubhubbub.appspot.com" />
	<atom:link rel="hub" href="http://superfeedr.com/hubbub" />
		<item>
		<title>By: Learning to do natural language processing with NLTK &#124; JetLlib Journal</title>
		<link>http://streamhacker.com/2010/03/15/nltk-classifier-based-chunker-accuracy/comment-page-1/#comment-539</link>
		<dc:creator>Learning to do natural language processing with NLTK &#124; JetLlib Journal</dc:creator>
		<pubDate>Sat, 03 Apr 2010 20:12:05 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.com/?p=1027#comment-539</guid>
		<description>[...] NLTK Classifier Based Chunker Accuracy [...]</description>
		<content:encoded><![CDATA[<p>[...] NLTK Classifier Based Chunker Accuracy [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jacob Perkins</title>
		<link>http://streamhacker.com/2010/03/15/nltk-classifier-based-chunker-accuracy/comment-page-1/#comment-603</link>
		<dc:creator>Jacob Perkins</dc:creator>
		<pubDate>Tue, 30 Mar 2010 04:10:37 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.com/?p=1027#comment-603</guid>
		<description>Sorry for the late reply, somehow the notification got caught in my spam filter. Anyway...&lt;br&gt;&lt;br&gt;If you think the iterations are cutting off too quickly, you might want to try tweaking some of the cutoffs described here: &lt;a href=&quot;http://nltk.googlecode.com/svn/trunk/doc/api/nltk.classify.maxent.MaxentClassifier-class.html#train&quot; rel=&quot;nofollow&quot;&gt;http://nltk.googlecode.com/svn/trunk/doc/api/nl...&lt;/a&gt;&lt;br&gt;&lt;br&gt;Wish I could help with the NE. All I can think of is to suggest checking out the names corpus: &lt;a href=&quot;http://nltk.googlecode.com/svn/trunk/doc/api/nltk.corpus-module.html#names&quot; rel=&quot;nofollow&quot;&gt;http://nltk.googlecode.com/svn/trunk/doc/api/nl...&lt;/a&gt;&lt;br&gt;Maybe you can train on that somehow.</description>
		<content:encoded><![CDATA[<p>Sorry for the late reply, somehow the notification got caught in my spam filter. Anyway&#8230;</p>
<p>If you think the iterations are cutting off too quickly, you might want to try tweaking some of the cutoffs described here: <a href="http://nltk.googlecode.com/svn/trunk/doc/api/nltk.classify.maxent.MaxentClassifier-class.html#train" rel="nofollow">http://nltk.googlecode.com/svn/trunk/doc/api/nl&#8230;</a></p>
<p>Wish I could help with the NE. All I can think of is to suggest checking out the names corpus: <a href="http://nltk.googlecode.com/svn/trunk/doc/api/nltk.corpus-module.html#names" rel="nofollow">http://nltk.googlecode.com/svn/trunk/doc/api/nl&#8230;</a><br />Maybe you can train on that somehow.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jacob Perkins</title>
		<link>http://streamhacker.com/2010/03/15/nltk-classifier-based-chunker-accuracy/comment-page-1/#comment-537</link>
		<dc:creator>Jacob Perkins</dc:creator>
		<pubDate>Mon, 29 Mar 2010 21:10:37 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.com/?p=1027#comment-537</guid>
		<description>Sorry for the late reply, somehow the notification got caught in my spam filter. Anyway...&lt;br&gt;&lt;br&gt;If you think the iterations are cutting off too quickly, you might want to try tweaking some of the cutoffs described here: &lt;a href=&quot;http://nltk.googlecode.com/svn/trunk/doc/api/nltk.classify.maxent.MaxentClassifier-class.html#train&quot; rel=&quot;nofollow&quot;&gt;http://nltk.googlecode.com/svn/trunk/doc/api/nl...&lt;/a&gt;&lt;br&gt;&lt;br&gt;Wish I could help with the NE. All I can think of is to suggest checking out the names corpus: &lt;a href=&quot;http://nltk.googlecode.com/svn/trunk/doc/api/nltk.corpus-module.html#names&quot; rel=&quot;nofollow&quot;&gt;http://nltk.googlecode.com/svn/trunk/doc/api/nl...&lt;/a&gt;&lt;br&gt;Maybe you can train on that somehow.</description>
		<content:encoded><![CDATA[<p>Sorry for the late reply, somehow the notification got caught in my spam filter. Anyway&#8230;</p>
<p>If you think the iterations are cutting off too quickly, you might want to try tweaking some of the cutoffs described here: <a href="http://nltk.googlecode.com/svn/trunk/doc/api/nltk.classify.maxent.MaxentClassifier-class.html#train" rel="nofollow">http://nltk.googlecode.com/svn/trunk/doc/api/nl&#8230;</a></p>
<p>Wish I could help with the NE. All I can think of is to suggest checking out the names corpus: <a href="http://nltk.googlecode.com/svn/trunk/doc/api/nltk.corpus-module.html#names" rel="nofollow">http://nltk.googlecode.com/svn/trunk/doc/api/nl&#8230;</a><br />Maybe you can train on that somehow.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: James Smith</title>
		<link>http://streamhacker.com/2010/03/15/nltk-classifier-based-chunker-accuracy/comment-page-1/#comment-534</link>
		<dc:creator>James Smith</dc:creator>
		<pubDate>Fri, 26 Mar 2010 16:00:57 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.com/?p=1027#comment-534</guid>
		<description>I&#039;ve got my computer finally setup so I&#039;m training the stuff myself now.&lt;br&gt;&lt;br&gt;I&#039;m training a few different ones to evaluate but I suspect I may be doing something wrong as the iterations go 1, 2, Final. Any ideas?&lt;br&gt;&lt;br&gt;Unlike yourself, I&#039;m also interested in the named entity side of this. Unfortunately the NLTK default chunker is pretty bad at recognising the entity types so I&#039;m using conll2002s chunked_sents as a training corpus. I&#039;d also like to use the treebank NE material but suspect I would have to change the tags to the same names as those used in conll materials.</description>
		<content:encoded><![CDATA[<p>I&#39;ve got my computer finally setup so I&#39;m training the stuff myself now.</p>
<p>I&#39;m training a few different ones to evaluate but I suspect I may be doing something wrong as the iterations go 1, 2, Final. Any ideas?</p>
<p>Unlike yourself, I&#39;m also interested in the named entity side of this. Unfortunately the NLTK default chunker is pretty bad at recognising the entity types so I&#39;m using conll2002s chunked_sents as a training corpus. I&#39;d also like to use the treebank NE material but suspect I would have to change the tags to the same names as those used in conll materials.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jacob Perkins</title>
		<link>http://streamhacker.com/2010/03/15/nltk-classifier-based-chunker-accuracy/comment-page-1/#comment-533</link>
		<dc:creator>Jacob Perkins</dc:creator>
		<pubDate>Thu, 25 Mar 2010 01:41:23 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.com/?p=1027#comment-533</guid>
		<description>What corpus would you want it trained on, treebank, conll2000, or both? Any suggestions on where to put the file? (it&#039;s a couple megabytes)</description>
		<content:encoded><![CDATA[<p>What corpus would you want it trained on, treebank, conll2000, or both? Any suggestions on where to put the file? (it&#39;s a couple megabytes)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: James Smith</title>
		<link>http://streamhacker.com/2010/03/15/nltk-classifier-based-chunker-accuracy/comment-page-1/#comment-530</link>
		<dc:creator>James Smith</dc:creator>
		<pubDate>Tue, 23 Mar 2010 19:23:15 +0000</pubDate>
		<guid isPermaLink="false">http://streamhacker.com/?p=1027#comment-530</guid>
		<description>I don&#039;t suppose  you&#039;d be willing to upload the trained chunker as a serialized object would you?</description>
		<content:encoded><![CDATA[<p>I don&#39;t suppose  you&#39;d be willing to upload the trained chunker as a serialized object would you?</p>
]]></content:encoded>
	</item>
</channel>
</rss>

