<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://replica.wiki.extremist.software/index.php?action=history&amp;feed=atom&amp;title=Machine_Learning_Meetup_Notes%3A_2010-05-19</id>
	<title>Machine Learning Meetup Notes: 2010-05-19 - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://replica.wiki.extremist.software/index.php?action=history&amp;feed=atom&amp;title=Machine_Learning_Meetup_Notes%3A_2010-05-19"/>
	<link rel="alternate" type="text/html" href="https://replica.wiki.extremist.software/index.php?title=Machine_Learning_Meetup_Notes:_2010-05-19&amp;action=history"/>
	<updated>2026-04-05T05:43:00Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.39.13</generator>
	<entry>
		<id>https://replica.wiki.extremist.software/index.php?title=Machine_Learning_Meetup_Notes:_2010-05-19&amp;diff=11338&amp;oldid=prev</id>
		<title>SpammerHellDontDelete at 05:04, 24 May 2010</title>
		<link rel="alternate" type="text/html" href="https://replica.wiki.extremist.software/index.php?title=Machine_Learning_Meetup_Notes:_2010-05-19&amp;diff=11338&amp;oldid=prev"/>
		<updated>2010-05-24T05:04:05Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 22:04, 23 May 2010&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l1&quot;&gt;Line 1:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 1:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;*Erin provided a list of unique SubSkills and TracedSkills with frequencies, as well as a python script to normalize the skill values in the challenge sets.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;*Erin provided a list of unique SubSkills and TracedSkills with frequencies, as well as a python script to normalize the skill values in the challenge sets.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;*Vikram gave a presentation on Hadoop, EC2 and MapReduce.  He created a bunch of scripts for EC2 MapReduce.  Those tools can be found on [http://github.com/voberoi/hadoop-mrutils github].&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;*Vikram gave a presentation &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;and demo &lt;/ins&gt;on Hadoop, EC2 and MapReduce.  He created a bunch of scripts for EC2 MapReduce.  Those tools can be found on [http://github.com/voberoi/hadoop-mrutils github].&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Here are some map reduce notes:&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Here are some map reduce notes:&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>SpammerHellDontDelete</name></author>
	</entry>
	<entry>
		<id>https://replica.wiki.extremist.software/index.php?title=Machine_Learning_Meetup_Notes:_2010-05-19&amp;diff=11335&amp;oldid=prev</id>
		<title>SpammerHellDontDelete: Created page with &#039;*Erin provided a list of unique SubSkills and TracedSkills with frequencies, as well as a python script to normalize the skill values in the challenge sets. *Vikram gave a presen…&#039;</title>
		<link rel="alternate" type="text/html" href="https://replica.wiki.extremist.software/index.php?title=Machine_Learning_Meetup_Notes:_2010-05-19&amp;diff=11335&amp;oldid=prev"/>
		<updated>2010-05-24T04:50:16Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;#039;*Erin provided a list of unique SubSkills and TracedSkills with frequencies, as well as a python script to normalize the skill values in the challenge sets. *Vikram gave a presen…&amp;#039;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;*Erin provided a list of unique SubSkills and TracedSkills with frequencies, as well as a python script to normalize the skill values in the challenge sets.&lt;br /&gt;
*Vikram gave a presentation on Hadoop, EC2 and MapReduce.  He created a bunch of scripts for EC2 MapReduce.  Those tools can be found on [http://github.com/voberoi/hadoop-mrutils github].&lt;br /&gt;
&lt;br /&gt;
Here are some map reduce notes:&lt;br /&gt;
&lt;br /&gt;
Word Counts (let line number be the key):&lt;br /&gt;
&lt;br /&gt;
1	hello how are you &lt;br /&gt;
&lt;br /&gt;
2	how is it going&lt;br /&gt;
&lt;br /&gt;
3 	are you happy&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;def map(key, value):&lt;br /&gt;
&lt;br /&gt;
	words = value.split()&lt;br /&gt;
&lt;br /&gt;
	#[&amp;quot;hello&amp;quot;, &amp;quot;how&amp;quot;, &amp;quot;are&amp;quot;, &amp;quot;you&amp;quot;]&lt;br /&gt;
&lt;br /&gt;
	for word in words&lt;br /&gt;
&lt;br /&gt;
		emit(word, 1)&lt;br /&gt;
		&lt;br /&gt;
&lt;br /&gt;
def reduce(key, values):&lt;br /&gt;
&lt;br /&gt;
	emit(key, len(values))	&amp;lt;/pre&amp;gt;&lt;br /&gt;
	&lt;br /&gt;
results:&lt;br /&gt;
&lt;br /&gt;
hello 	[1]&lt;br /&gt;
&lt;br /&gt;
how		[1,1]&lt;br /&gt;
&lt;br /&gt;
are		[1,1]&lt;/div&gt;</summary>
		<author><name>SpammerHellDontDelete</name></author>
	</entry>
</feed>