<?xml version="1.0" encoding="UTF-8"?><!-- generator="bbPress" -->

<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
>

<channel>
<title>FlowingData Forums &#187; Tag: big data - Recent Posts</title>
<link>http://forums.flowingdata.com/</link>
<description>Strength in Numbers</description>
<language>en</language>
<pubDate>Sat, 11 Feb 2012 19:18:57 +0000</pubDate>

<item>
<title>Data Storyteller at Knewton - NYC</title>
<link>http://forums.flowingdata.com/topic/data-storyteller-at-knewton-nyc#post-2404</link>
<pubDate>Fri, 18 Nov 2011 17:08:38 +0000</pubDate>
<dc:creator>dzwieback</dc:creator>
<guid isPermaLink="false">2404@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Knewton is looking for master data storyteller to explore vast and growing quantities of educational data. You'll be joining an elite team that is building the world’s most powerful adaptive learning platform.&#60;/p&#62;
&#60;p&#62;You are obsessive about data visualization, a fearless defender of truth in data, and an experienced exterminator of chartjunk. You are a master or mistress of SQL, Excel, Tableau (and the like). You have some programming experience (python, perl, shell, etc.), and familiarity with statistical methods and tools (e.g., R). Above all, you are intellectually curious.&#60;/p&#62;
&#60;p&#62;Knewton is based in NYC’s Union Square, but you can work from pretty much anywhere. Benefits include Macs with huge monitors, a kitchen filled with your favorite treats, flexible hours, frequent guest speakers, company-sponsored training and conferences, excellent health benefits, equity, and a chance to do meaningful, cutting edge work.&#60;/p&#62;
&#60;p&#62;Apply at &#60;a href=&#34;http://jobs.knewton.com/apply/u4UOuS/Master-Data-Storyteller.html&#34; rel=&#34;nofollow&#34;&#62;http://jobs.knewton.com/apply/u4UOuS/Master-Data-Storyteller.html&#60;/a&#62; .
&#60;/p&#62;</description>
</item>
<item>
<title>Pythonic Software Engineer</title>
<link>http://forums.flowingdata.com/topic/pythonic-software-engineer#post-2338</link>
<pubDate>Tue, 13 Sep 2011 02:46:24 +0000</pubDate>
<dc:creator>josverwoerd</dc:creator>
<guid isPermaLink="false">2338@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Job Description&#60;/p&#62;
&#60;p&#62;We seek a PEP8 Software Engineer with 5+ years experience developing large-scale, distributed applications. BigML's development is done in Corvallis, OR.&#60;/p&#62;
&#60;p&#62;Responsibilities:&#60;/p&#62;
&#60;p&#62;    Develop new highly scalable intelligent applications capable of processing hundreds of terabytes of data.&#60;/p&#62;
&#60;p&#62;Desired Skills &#38;amp; Experience&#60;/p&#62;
&#60;p&#62;Must Have:&#60;/p&#62;
&#60;p&#62;    BS degree in Computer Science.&#60;br /&#62;
    5 years of large-scale software design and development experience, with extensive knowledge of Unix.&#60;br /&#62;
    4 years experience with C/Python and PyPy.&#60;br /&#62;
    Best software engineering practices.&#60;br /&#62;
    Passionate about coding.&#60;br /&#62;
    Passionate about distributed applications.&#60;br /&#62;
    Passionate about machine learning applications.&#60;br /&#62;
    Passionate about solving large-scale problems. &#60;/p&#62;
&#60;p&#62;Nice to have:&#60;/p&#62;
&#60;p&#62;    Experience with MongoDB, replica sets, and sharding.&#60;br /&#62;
    Experience with HBase and Hadoop.&#60;br /&#62;
    Experience with RabbitMQ.&#60;br /&#62;
    Experience with Tornado.&#60;br /&#62;
    Experience with Django.&#60;br /&#62;
    Experience with Nginx.&#60;br /&#62;
    Masters or Ph.D. in Computer Science.&#60;/p&#62;
&#60;p&#62;Company Description&#60;/p&#62;
&#60;p&#62;BigML, Inc is a new start-up that is developing a highly-scalable, easy-to-use machine learning service for big data.  We are seeking world-class software engineers who are passionate about creating the next generation of intelligent services.  If you want to change the world, try helping us predict its future.
&#60;/p&#62;</description>
</item>
<item>
<title>Data-Visualization/UI Lead Developer for Venture-Backed Startup (Quantifind)</title>
<link>http://forums.flowingdata.com/topic/data-visualizationui-lead-developer-for-venture-backed-startup-quantifind#post-1880</link>
<pubDate>Tue, 07 Sep 2010 19:31:37 +0000</pubDate>
<dc:creator>quantifind</dc:creator>
<guid isPermaLink="false">1880@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Title: Data-Visualization/UI Lead Developer for Venture-Backed Startup (Quantifind)&#60;br /&#62;
Company: Quantifind, Inc.&#60;br /&#62;
Location: Palo Alto, CA&#60;/p&#62;
&#60;p&#62;---&#60;br /&#62;
Description:&#60;/p&#62;
&#60;p&#62;Quantifind is a well-funded startup located in Palo Alto, CA, supported by a tier-1 VC and founded by physicists from Stanford University.  We have developed a powerful platform for unstructured-data analytics and are seeking an experienced User Interface virtuoso to unleash its full potential.&#60;/p&#62;
&#60;p&#62;The ideal candidate should be passionate about data visualization and capable of designing compelling, exploratory visualizations of data dynamically extracted from huge unstructured repositories.  Our product has already elicited from early customers the feeling of power that the best big-data applications bring.  The candidate's job will be to combine this processing potential with the sense of front-end magic that the best modern visualization tools provide.  With a combination of programming skills, an intuitive sense of design, and an ability to integrate user feedback, we expect the applicant to create a polished product that resonates with the customer's need to process information efficiently, collectively, and visually.&#60;/p&#62;
&#60;p&#62;We are most interested in seeing a portfolio that demonstrates both originality and an awareness of usability from an applicant, although several years of experience is a bonus.&#60;/p&#62;
&#60;p&#62;This lead developer position comes with a very competitive compensation and equity offer that matches the large impact that the position is expected to provide.&#60;/p&#62;
&#60;p&#62;---&#60;br /&#62;
Recommended skills: &#60;/p&#62;
&#60;p&#62;- UI, UX, HCI&#60;br /&#62;
- Javascript, jQuery&#60;br /&#62;
- HTML5, CSS, Canvas, SVG&#60;br /&#62;
- Web-based plotting/charting visualization libraries&#60;br /&#62;
- R, Processing, Tableau&#60;br /&#62;
- Google Web Toolkit (GWT)&#60;br /&#62;
- Java&#60;br /&#62;
- Grails/Rails&#60;br /&#62;
- Adobe Illustrator, Photoshop&#60;/p&#62;
&#60;p&#62;---&#60;br /&#62;
Experience with...&#60;br /&#62;
…modern web-based data analysis and visualization tools&#60;br /&#62;
…building visual (or standard) Information Retrieval/Search interfaces&#60;br /&#62;
…design; aesthetic principles concerning the communication of quantitative data (Tufte)&#60;br /&#62;
…big-data science and general methods for large-scale statistical processing of scientific data&#60;br /&#62;
…user testing, both direct and tracking&#60;/p&#62;
&#60;p&#62;---&#60;/p&#62;
&#60;p&#62;Quantifind welcomes applicants of all origins and will sponsor H1B candidates.&#60;/p&#62;
&#60;p&#62;---&#60;br /&#62;
Contact:  &#60;/p&#62;
&#60;p&#62;Please email &#60;a href=&#34;mailto:careers@quantifind.com&#34; rel=&#34;nofollow&#34;&#62;careers@quantifind.com&#60;/a&#62; with a resume, portfolio, or questions regarding this opening.&#60;/p&#62;
&#60;p&#62;(Resumes using Comic Sans should be sent to &#60;a href=&#34;mailto:omgwtf@quantifind.com&#34; rel=&#34;nofollow&#34;&#62;omgwtf@quantifind.com&#60;/a&#62; for expedited processing.)
&#60;/p&#62;</description>
</item>
<item>
<title>MachetEC2: Open Visualization / Big Data Toolkit on Amazon EC2</title>
<link>http://forums.flowingdata.com/topic/machetec2-open-visualization-big-data-toolkit-on-amazon-ec2#post-686</link>
<pubDate>Wed, 06 May 2009 04:01:01 +0000</pubDate>
<dc:creator>java-nokia</dc:creator>
<guid isPermaLink="false">686@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Hello, Very Nice forum!&#60;br /&#62;
&#60;a href=&#34;http://xn--90aeflbqelf1a1jeg.com/&#34; title=&#34;Java Nokia&#34; rel=&#34;nofollow&#34;&#62;Java for Nokia&#60;/a&#62;
&#60;/p&#62;</description>
</item>
<item>
<title>MachetEC2: Open Visualization / Big Data Toolkit on Amazon EC2</title>
<link>http://forums.flowingdata.com/topic/machetec2-open-visualization-big-data-toolkit-on-amazon-ec2#post-386</link>
<pubDate>Fri, 06 Feb 2009 17:44:55 +0000</pubDate>
<dc:creator>dhruv</dc:creator>
<guid isPermaLink="false">386@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;machetEC2 is released!  Check out our &#60;a href=&#34;http://machetec2.org&#34; rel=&#34;nofollow&#34;&#62;blog post&#60;/a&#62; (soon to be wiki) for more details.  The id of the image is ami-29ef0840.  Launch it and try it out!
&#60;/p&#62;</description>
</item>
<item>
<title>Digging Into Data Challenege</title>
<link>http://forums.flowingdata.com/topic/digging-into-data-challenege#post-382</link>
<pubDate>Thu, 05 Feb 2009 17:08:30 +0000</pubDate>
<dc:creator>cloister</dc:creator>
<guid isPermaLink="false">382@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;It means that research changes focus.&#60;/p&#62;
&#60;p&#62;The old focus was on discovering new facts.  The new focus is on discovering &#60;strong&#62;patterns of facts&#60;/strong&#62;.&#60;/p&#62;
&#60;p&#62;This is particularly notable in archaeology.  If ever there was a science whose public image is on the literal discovery of new facts--in this case, arti&#34;facts&#34; buried within historical sites--archaeology would have to be it.&#60;/p&#62;
&#60;p&#62;But in the past decade or so, archaeologists have begun to put all their data into databases and to data-mine them.  In the American Southwest, for example, there are now databases that are composed of the ages, locations, and many other parameters of thousands of dig sites and millions of objects from the American Indian cultures that previously lived there.&#60;/p&#62;
&#60;p&#62;And data mining is starting to provide answers to questions that were previously unapproachable.  Who were these people?  What were their geographical ranges?  How did they trade goods between each other, often across vast distances, on foot?  How did one society's art and technology affect its neighbors?&#60;/p&#62;
&#60;p&#62;By looking at patterns in the data, such as projectile points vs. knapping style vs. time, researchers can get a picture of where a particular technology originated, when it originated, and how it spread across the Southwest and Central America.&#60;/p&#62;
&#60;p&#62;Yes, discovering a new dig site can be exciting and revolutionary.  But in another sense, discovering yet another dig site is kind of ho-hum.  After you've already got a few million of them catalogued, digging up another few hundred arrowheads or broken pots or whatever becomes kind of meaningless.  It's only when you can put that new data into context with all the rest and look at the patterns that emerge, can the data take on any real meaning.
&#60;/p&#62;</description>
</item>
<item>
<title>Digging Into Data Challenege</title>
<link>http://forums.flowingdata.com/topic/digging-into-data-challenege#post-381</link>
<pubDate>Thu, 05 Feb 2009 00:41:40 +0000</pubDate>
<dc:creator>nathany</dc:creator>
<guid isPermaLink="false">381@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;&#60;a href=&#34;http://www.diggingintodata.org/&#34; rel=&#34;nofollow&#34;&#62;http://www.diggingintodata.org/&#60;/a&#62; [via &#60;a href=&#34;http://delicious.com/keyvowel&#34; rel=&#34;nofollow&#34;&#62;keyvowel&#60;/a&#62;]&#60;/p&#62;
&#60;blockquote&#62;&#60;p&#62;What is the &#34;challenge&#34; we speak of?  The idea behind the Digging into Data Challenge is to answer the question &#34;what do you do with a million books?&#34;  Or a million pages of newspaper? Or a million photographs of artwork?  That is, how does the notion of scale affect humanities and social science research? Now that scholars have access to huge repositories of digitized data -- far more than they could read in a lifetime -- what does that mean for research?&#60;/p&#62;&#60;/blockquote&#62;</description>
</item>
<item>
<title>MachetEC2: Open Visualization / Big Data Toolkit on Amazon EC2</title>
<link>http://forums.flowingdata.com/topic/machetec2-open-visualization-big-data-toolkit-on-amazon-ec2#post-336</link>
<pubDate>Wed, 28 Jan 2009 23:18:23 +0000</pubDate>
<dc:creator>nathany</dc:creator>
<guid isPermaLink="false">336@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Ah, I get it now
&#60;/p&#62;</description>
</item>
<item>
<title>MachetEC2: Open Visualization / Big Data Toolkit on Amazon EC2</title>
<link>http://forums.flowingdata.com/topic/machetec2-open-visualization-big-data-toolkit-on-amazon-ec2#post-334</link>
<pubDate>Wed, 28 Jan 2009 19:35:43 +0000</pubDate>
<dc:creator>dhruv</dc:creator>
<guid isPermaLink="false">334@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Amazon's &#60;a href=&#34;http://aws.amazon.com/ec2/&#34; rel=&#34;nofollow&#34;&#62;EC2&#60;/a&#62; allows users to instantiate a virtual computer with a pre-installed operating system, software packages, and up to 1 TB of data pre-loaded on disk, ready to work with, from a pre-defined image (an &#34;Amazon Machine Image&#34;, or AMI).&#60;/p&#62;
&#60;p&#62;MachetEC2 is an effort by a group of &#60;a href=&#34;http://infochimps.org&#34; rel=&#34;nofollow&#34;&#62;Infochimps&#60;/a&#62; to create an AMI for data processing, analysis, and visualization.  If you create an instance of MachetEC2, you'll be have an environment with tools designed for working with data ready to go.  All you'll need to do is load in some data and get to work!
&#60;/p&#62;</description>
</item>
<item>
<title>MachetEC2: Open Visualization / Big Data Toolkit on Amazon EC2</title>
<link>http://forums.flowingdata.com/topic/machetec2-open-visualization-big-data-toolkit-on-amazon-ec2#post-332</link>
<pubDate>Wed, 28 Jan 2009 18:43:49 +0000</pubDate>
<dc:creator>nathany</dc:creator>
<guid isPermaLink="false">332@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;So in layman's terms... MachetEC2 would be used to make data storage and retrieval easier with Amazon EC2?
&#60;/p&#62;</description>
</item>
<item>
<title>MachetEC2: Open Visualization / Big Data Toolkit on Amazon EC2</title>
<link>http://forums.flowingdata.com/topic/machetec2-open-visualization-big-data-toolkit-on-amazon-ec2#post-331</link>
<pubDate>Wed, 28 Jan 2009 18:40:20 +0000</pubDate>
<dc:creator>mrflip</dc:creator>
<guid isPermaLink="false">331@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Wow, 15 minutes later and already some great additions so far from &#60;a href=&#34;http://search.twitter.com/search?q=+infochimp+OR+infochimps+OR+machetec2&#34; rel=&#34;nofollow&#34;&#62;twitter&#60;/a&#62;.&#60;/p&#62;
&#60;p&#62;Including:&#60;/p&#62;
&#60;p&#62;* @hackingdata: prefuse/flare&#60;/p&#62;
&#60;p&#62;* @peteskomoroch: @infochimps I did something similar a while back, might want to add ipython, boto. some libs I considered: &#60;a href=&#34;http://tinyurl.com/ckl2zr&#34; rel=&#34;nofollow&#34;&#62;http://tinyurl.com/ckl2zr&#60;/a&#62; (&#38;lt;-- whoa. Awesome.)&#60;/p&#62;
&#60;p&#62;* @dwf: Mayavi2 and its dependency, VTK: Great for 3D viz stuff. PyMC, definitely. matplotlib. Maybe Boost?&#60;/p&#62;
&#60;p&#62;* @grantmichaels: erlang&#60;/p&#62;
&#60;p&#62;* @ogrisel: happy or disco for mapreduce, numpy, scipy, pylab, boto, mdp, lxml, liblinear, cython, gcc, torch5  &#60;/p&#62;
&#60;p&#62;Already more than our finite monkeys can roll out, but with this kind of interest we'll get the ball rolling (or get behind @peteskomoroch's ball and help push) and watch what happens from there.
&#60;/p&#62;</description>
</item>
<item>
<title>MachetEC2: Open Visualization / Big Data Toolkit on Amazon EC2</title>
<link>http://forums.flowingdata.com/topic/machetec2-open-visualization-big-data-toolkit-on-amazon-ec2#post-330</link>
<pubDate>Wed, 28 Jan 2009 17:58:39 +0000</pubDate>
<dc:creator>mrflip</dc:creator>
<guid isPermaLink="false">330@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Hi,&#60;br /&#62;
We're assembling MachetEC2, a free Amazon AWS EC2 instance that comes out of the box with a full suite of data analysis and visualization tools. (Obviously the compute time is on your dime, but the base instance is there for you to grab, for free, and get to hackin')&#60;/p&#62;
&#60;p&#62;What tools do you want to have in there?  We're going to push this up in the next few days -- look for an announcement on the &#60;a href=&#34;blog.infochimps.org&#34; rel=&#34;nofollow&#34;&#62;infochimps blog&#60;/a&#62;, and we'll reply here too. If you'd like to help, set up something you find essential and tweet/email us to 'pull' from your instance. (You'll have to use a credit card to get an instance, but it's so cheap it'll only cost a buck or two; and once it's set up we'll pull it onto our account where everyone can get it.) If there's already something like this floating around, please let us know and we'll work with them instead.&#60;/p&#62;
&#60;p&#62;We'll initially post up some subset of:&#60;/p&#62;
&#60;p&#62;* Ruby, Python, Erlang, R&#60;br /&#62;
* MySQL, PostgreSQL&#60;br /&#62;
* AllegroGraph, CouchDB&#60;br /&#62;
* Hadoop, Hive, Pig&#60;br /&#62;
* Cytoscape, Gruff&#60;br /&#62;
* Processing, Prefuse/Flare, Modest Maps&#60;br /&#62;
* NLTK, SciPy&#60;br /&#62;
  ...&#60;br /&#62;
* YOUR SUGGESTION HERE&#60;/p&#62;
&#60;p&#62;This will fit in with the things we're helping to add to the AWS Public Data Sets. We want you to be able to use infochimps to find something or pull down the Public Data Set of your choice; to load that dataset next to your MachetEC2 instance clone, and be whailing on data from go.&#60;/p&#62;
&#60;p&#62;At some later point we can specialize (MachetEC2-viz, MachetEC2-ML for machine learning folks, MachetEC2-ling for linguistics, etc), so feel free to suggest things of (somewhat) narrow focus, and even more free as I said to set it up and ping us.  Free/Open software only, of cours.
&#60;/p&#62;</description>
</item>

</channel>
</rss>

