<?xml version="1.0" encoding="UTF-8"?><!-- generator="bbPress" -->

<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
>

<channel>
<title>FlowingData Forums &#187; Forum: Statistics and Data - Recent Posts</title>
<link>http://forums.flowingdata.com/</link>
<description>Strength in Numbers</description>
<language>en</language>
<pubDate>Sat, 11 Feb 2012 20:23:37 +0000</pubDate>

<item>
<title>Getting Started w/ your.flowingdata</title>
<link>http://forums.flowingdata.com/topic/getting-started-w-yourflowingdata#post-2525</link>
<pubDate>Wed, 08 Feb 2012 17:54:59 +0000</pubDate>
<dc:creator>eklypse</dc:creator>
<guid isPermaLink="false">2525@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;I'd like to use yfd on a protected account. That is, anyone who wants to follow me must submit a request which I can then approve. I hoped that whatever automated feature that would cause yfd to follow back would thus generate such a request which I could then approve, but as yet, no request has appeared. I started following @yfd a couple hours ago, so I figured the request would appear by now. Is there anything I can do to make this work (short of turning off the privacy on my account)? Thanks in advance!
&#60;/p&#62;</description>
</item>
<item>
<title>Looking for a Starbucks calculator</title>
<link>http://forums.flowingdata.com/topic/looking-for-a-starbucks-calculator#post-2468</link>
<pubDate>Wed, 04 Jan 2012 07:25:13 +0000</pubDate>
<dc:creator>brcire</dc:creator>
<guid isPermaLink="false">2468@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;In search of a Starbucks calculator that shows/graphs how much you spend per week, month, per year(s) and runs a parallel result of how that money would look if you invested it at standard interest rates.&#60;/p&#62;
&#60;p&#62;I recall one several years ago but cannot find it.&#60;/p&#62;
&#60;p&#62;Thanks for your help!&#60;br /&#62;
Eric
&#60;/p&#62;</description>
</item>
<item>
<title>Where to find worldwide sunrise and sunset data set?</title>
<link>http://forums.flowingdata.com/topic/where-to-find-worldwide-sunrise-and-sunset-data-set#post-2460</link>
<pubDate>Fri, 30 Dec 2011 13:36:00 +0000</pubDate>
<dc:creator>nathany</dc:creator>
<guid isPermaLink="false">2460@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Weather Underground has that info, but you'll have to do some scraping.
&#60;/p&#62;</description>
</item>
<item>
<title>Where to find worldwide sunrise and sunset data set?</title>
<link>http://forums.flowingdata.com/topic/where-to-find-worldwide-sunrise-and-sunset-data-set#post-2459</link>
<pubDate>Thu, 29 Dec 2011 23:22:54 +0000</pubDate>
<dc:creator>patrickberkeley</dc:creator>
<guid isPermaLink="false">2459@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Hi, does anyone know where I could find an open data set containing worldwide sunrise and sunset times? Thanks, Patrick
&#60;/p&#62;</description>
</item>
<item>
<title>Available:  CRS, Health Insurance Coverage by State and Congressional District</title>
<link>http://forums.flowingdata.com/topic/available-crs-health-insurance-coverage-by-state-and-congressional-district#post-2390</link>
<pubDate>Mon, 24 Oct 2011 19:26:26 +0000</pubDate>
<dc:creator>prescottrjp</dc:creator>
<guid isPermaLink="false">2390@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Available:&#60;br /&#62;
&#60;a href=&#34;http://goo.gl/WgKu5&#34; rel=&#34;nofollow&#34;&#62;http://goo.gl/WgKu5&#60;/a&#62;&#60;br /&#62;
Congressional Research Service, Health Insurance Coverage by State and Congressional District;&#60;br /&#62;
&#60;a href=&#34;http://goo.gl/WgKu5&#34; rel=&#34;nofollow&#34;&#62;http://goo.gl/WgKu5&#60;/a&#62;&#60;/p&#62;
&#60;p&#62;Summary&#60;br /&#62;
The total U.S. civilian non-institutionalized population in 2010 was estimated to be slightly more than 304 million. Roughly 84.5% of the U.S. civilian non-institutionalized population had one or more forms of health insurance, while 15.5%, or roughly 47.2 million, were uninsured. The most common form of insurance was employer provided.&#60;br /&#62;
This report employs the U.S. Census Bureau’s 2010 American Community Survey (ACS) to describe health insurance coverage and provide estimates of coverage by type of coverage at the national, state, and congressional district level. The ACS survey has a sample of more than 2 million respondents and solicits health insurance coverage information as of the date of the survey. The sample is large enough to provide accurate estimates of coverage at the congressional district level at a point in time. As this report details, there are considerable differences across states, within states, and across demographic groups in the proportion of insured and their sources of coverage.
&#60;/p&#62;</description>
</item>
<item>
<title>Creating matrix for heatmap in R</title>
<link>http://forums.flowingdata.com/topic/creating-matrix-for-heatmap-in-r#post-2388</link>
<pubDate>Fri, 21 Oct 2011 01:17:16 +0000</pubDate>
<dc:creator>Dwash</dc:creator>
<guid isPermaLink="false">2388@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;I add my datasets to R in txt file in format shown above.&#60;/p&#62;
&#60;p&#62;name &#38;lt;- read.csv(&#34;name.txt&#34;, header=TRUE)&#60;/p&#62;
&#60;p&#62;Is this what are are wanting to know?&#60;/p&#62;
&#60;p&#62;I then use&#60;/p&#62;
&#60;p&#62;attach(name)&#60;/p&#62;
&#60;p&#62;This allows for great ease of columns in datasets.
&#60;/p&#62;</description>
</item>
<item>
<title>looking for a readable management (?) blog</title>
<link>http://forums.flowingdata.com/topic/looking-for-a-readable-management-blog#post-2359</link>
<pubDate>Fri, 30 Sep 2011 00:05:36 +0000</pubDate>
<dc:creator>c0nsole</dc:creator>
<guid isPermaLink="false">2359@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;In a lot of cases ETL is extracting information from a flat file (in the export step). You might read up on concepts of low-level methods of reading a file character by character.  &#60;/p&#62;
&#60;p&#62;For the transform step, you should read up on datatypes with focus on conversions from character strings to numbers and vice versa.  The first chapter of any C or C++ book should prepare you.&#60;/p&#62;
&#60;p&#62;For the load step, you might take the tutorials at w3schools.org to learn SQL.  From there, any book on relational databases will prepare you for what you would see at a fortune 500 company's databases.&#60;/p&#62;
&#60;p&#62;For management/leadership info, iTunes University has a tremendous amount of leadership podcasts that are free.  I download several at a time and listen to them on my commute to work.&#60;/p&#62;
&#60;p&#62;-c0nsole
&#60;/p&#62;</description>
</item>
<item>
<title>looking for a readable management (?) blog</title>
<link>http://forums.flowingdata.com/topic/looking-for-a-readable-management-blog#post-2358</link>
<pubDate>Mon, 26 Sep 2011 18:12:35 +0000</pubDate>
<dc:creator>justin_abraham</dc:creator>
<guid isPermaLink="false">2358@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;I am a mid-level manager at a Fortune 500 company with some time on my hands this quarter (don't ask). I am trying to broaden my horizons online, but everything I find is either:&#60;/p&#62;
&#60;p&#62;a) really interesting analytics/visualization stuff, which is sadly not my job&#60;br /&#62;
b) disgruntled developers channeling Dilbert&#60;br /&#62;
c) management or IT consultants trying to sell their wares with unreadable recycled jargon&#60;/p&#62;
&#60;p&#62;Will someone please recommend something as engaging, simple, and thought provoking as this site, but geared towards, say data warehousing or ETL? or even management? or are those topics just inherently boring (to write about)?
&#60;/p&#62;</description>
</item>
<item>
<title>Creating matrix for heatmap in R</title>
<link>http://forums.flowingdata.com/topic/creating-matrix-for-heatmap-in-r#post-2342</link>
<pubDate>Tue, 13 Sep 2011 13:31:42 +0000</pubDate>
<dc:creator>nathany</dc:creator>
<guid isPermaLink="false">2342@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;I do most of my data munging in Python, but I hear the reshape package in R, by Hadley Wickham is pretty good for this.
&#60;/p&#62;</description>
</item>
<item>
<title>Creating matrix for heatmap in R</title>
<link>http://forums.flowingdata.com/topic/creating-matrix-for-heatmap-in-r#post-2339</link>
<pubDate>Tue, 13 Sep 2011 04:27:54 +0000</pubDate>
<dc:creator>harald</dc:creator>
<guid isPermaLink="false">2339@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;I followed the excellent FlowingData tutorial &#34;How to Make a Heatmap – a Quick and Easy Solution&#34;: &#60;a href=&#34;http://flowingdata.com/2010/01/21/how-to-make-a-heatmap-a-quick-and-easy-solution/&#34; rel=&#34;nofollow&#34;&#62;http://flowingdata.com/2010/01/21/how-to-make-a-heatmap-a-quick-and-easy-solution/&#60;/a&#62;&#60;/p&#62;
&#60;p&#62;In the tutorial, the baseball dataset was already prepared as a matrix with players as rows and columns with various points. What is the easiest way of creating a dataset like the baseball dataset &#60;a href=&#34;http://datasets.flowingdata.com/ppg2008.csv&#34; rel=&#34;nofollow&#34;&#62;http://datasets.flowingdata.com/ppg2008.csv&#60;/a&#62; in R if the data has the format: &#60;/p&#62;
&#60;p&#62;Dwyane Wade, G, 79&#60;br /&#62;
Dwyane Wade, MIN, 38.6&#60;br /&#62;
Dwyane Wade, PTS, 30.2&#60;br /&#62;
Dwyane Wade, FGM, 10.8&#60;br /&#62;
Dwyane Wade, FGA, 22&#60;br /&#62;
Dwyane Wade, FGP, 0.491&#60;br /&#62;
Dwyane Wade, FTM, 7.5&#60;br /&#62;
Dwyane Wade, FTA, 9.8&#60;br /&#62;
Dwyane Wade, FTP, 0.765&#60;br /&#62;
Dwyane Wade, 3PM, 1.1&#60;br /&#62;
Dwyane Wade, 3PA, 3.5&#60;br /&#62;
Dwyane Wade, 3PP, 0.317&#60;br /&#62;
Dwyane Wade, ORB, 1.1&#60;br /&#62;
Dwyane Wade, DRB, 3.9&#60;br /&#62;
Dwyane Wade, TRB, 5&#60;br /&#62;
Dwyane Wade, AST, 7.5&#60;br /&#62;
Dwyane Wade, STL, 2.2&#60;br /&#62;
Dwyane Wade, BLK, 1.3&#60;br /&#62;
Dwyane Wade, TO, 3.4&#60;br /&#62;
Dwyane Wade, PF , 2.3&#60;br /&#62;
LeBron James, G, 81&#60;br /&#62;
LeBron James, MIN, 37.7&#60;br /&#62;
LeBron James, PTS, 28.4&#60;br /&#62;
LeBron James, FGM, 9.7&#60;br /&#62;
LeBron James, FGA, 19.9&#60;br /&#62;
LeBron James, FGP, 0.489&#60;br /&#62;
LeBron James, FTM, 7.3&#60;br /&#62;
LeBron James, FTA, 9.4&#60;br /&#62;
LeBron James, FTP, 0.78&#60;br /&#62;
LeBron James, 3PM, 1.6&#60;br /&#62;
LeBron James, 3PA, 4.7&#60;br /&#62;
LeBron James, 3PP, 0.344&#60;br /&#62;
LeBron James, ORB, 1.3&#60;br /&#62;
LeBron James, DRB, 6.3&#60;br /&#62;
LeBron James, TRB, 7.6&#60;br /&#62;
LeBron James, AST, 7.2&#60;br /&#62;
LeBron James, STL, 1.7&#60;br /&#62;
LeBron James, BLK, 1.1&#60;br /&#62;
LeBron James, TO, 3&#60;br /&#62;
LeBron James, PF, 1.7&#60;br /&#62;
....Kobe Bryant ...cont....&#60;br /&#62;
...cont....&#60;/p&#62;
&#60;p&#62;Doing such a transformation is straightforward in SQL or even a spreadsheet, but it would be great to know how to get it right in R.
&#60;/p&#62;</description>
</item>
<item>
<title>Android app / widget?</title>
<link>http://forums.flowingdata.com/topic/android-app-widget#post-2298</link>
<pubDate>Wed, 10 Aug 2011 05:00:15 +0000</pubDate>
<dc:creator>snataas</dc:creator>
<guid isPermaLink="false">2298@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Does anyone know of plans for an Android app, ideally with a widget for simple entry into your.flowingdata.com?&#60;br /&#62;
I know there is one on github but it seems not to be available in the market.&#60;br /&#62;
Thanks.
&#60;/p&#62;</description>
</item>
<item>
<title>Developer Contest with Infochimps data</title>
<link>http://forums.flowingdata.com/topic/developer-contest-with-infochimps-data#post-2250</link>
<pubDate>Tue, 05 Jul 2011 10:38:58 +0000</pubDate>
<dc:creator>nathany</dc:creator>
<guid isPermaLink="false">2250@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Build a quick app with Infochimps data and win the Tufte books:&#60;/p&#62;
&#60;p&#62;&#60;a href=&#34;http://blog.infochimps.com/2011/07/04/new-developer-contest-twilio-infochimps/&#34; rel=&#34;nofollow&#34;&#62;http://blog.infochimps.com/2011/07/04/new-developer-contest-twilio-infochimps/&#60;/a&#62;
&#60;/p&#62;</description>
</item>
<item>
<title>2010 Census Data got me down</title>
<link>http://forums.flowingdata.com/topic/2010-census-data-got-me-down#post-2232</link>
<pubDate>Wed, 15 Jun 2011 10:40:39 +0000</pubDate>
<dc:creator>nathany</dc:creator>
<guid isPermaLink="false">2232@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;I'm not familiar with Access, but the best place to start looking for Census data is the FactFinder:&#60;/p&#62;
&#60;p&#62;&#60;a href=&#34;http://factfinder2.census.gov/faces/nav/jsf/pages/index.xhtml&#34; rel=&#34;nofollow&#34;&#62;http://factfinder2.census.gov/faces/nav/jsf/pages/index.xhtml&#60;/a&#62;&#60;/p&#62;
&#60;p&#62;Start People, and then Age &#38;amp; Sex. Then you can download the data as CSV.
&#60;/p&#62;</description>
</item>
<item>
<title>2010 Census Data got me down</title>
<link>http://forums.flowingdata.com/topic/2010-census-data-got-me-down#post-2231</link>
<pubDate>Tue, 14 Jun 2011 23:14:44 +0000</pubDate>
<dc:creator>c0nsole</dc:creator>
<guid isPermaLink="false">2231@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;The doc at &#60;a href=&#34;http://www2.census.gov/census_2010/03-Demographic_Profile/Alabama/0README_DPSF.pdf&#34; rel=&#34;nofollow&#34;&#62;http://www2.census.gov/census_2010/03-Demographic_Profile/Alabama/0README_DPSF.pdf&#60;/a&#62; mentions that all numeric fields need to be imported into Access as long integers.  Give it a try that way if you haven't already done so.  You can get a list of what is considered numeric (labeled as N) in chapter 4 of the doc at &#60;a href=&#34;http://www.census.gov/prod/cen2010/doc/dpsf.pdf&#34; rel=&#34;nofollow&#34;&#62;http://www.census.gov/prod/cen2010/doc/dpsf.pdf&#60;/a&#62;&#60;/p&#62;
&#60;p&#62;I haven't checked the file sizes of these datasets, but thought it would be good to note that if you append all of the states's data together, keep in the back of your mind there is a 2GB size limitation in Access.&#60;/p&#62;
&#60;p&#62;As far as other options, you could create a quick pivot table in Excel 2007 if you have less than one million rows.  If the dataset is large, writing a quick perl script might be the way to go if you are familiar with the language.  Let us know how it turns out.  Good luck with your project!
&#60;/p&#62;</description>
</item>
<item>
<title>2010 Census Data got me down</title>
<link>http://forums.flowingdata.com/topic/2010-census-data-got-me-down#post-2230</link>
<pubDate>Tue, 14 Jun 2011 12:47:04 +0000</pubDate>
<dc:creator>joemurphy</dc:creator>
<guid isPermaLink="false">2230@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;All I want is to get average age by zip/tract but I'm lost. I've tried getting the 2010 census demogrphic profiles into access, but was left wanting to throw my computer out the window. &#60;/p&#62;
&#60;p&#62;Advice?&#60;/p&#62;
&#60;p&#62;&#60;a href=&#34;http://2010.census.gov/2010census/data/&#34; rel=&#34;nofollow&#34;&#62;http://2010.census.gov/2010census/data/&#60;/a&#62;
&#60;/p&#62;</description>
</item>
<item>
<title>Medicare Claims Data Challenge</title>
<link>http://forums.flowingdata.com/topic/medicare-claims-data-challenge#post-2220</link>
<pubDate>Fri, 03 Jun 2011 01:17:23 +0000</pubDate>
<dc:creator>nathany</dc:creator>
<guid isPermaLink="false">2220@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Looks like it might be a good one:&#60;/p&#62;
&#60;p&#62;&#60;a href=&#34;http://www.health2challenge.org/2011/06/01/medicare-claims-data/&#34; rel=&#34;nofollow&#34;&#62;http://www.health2challenge.org/2011/06/01/medicare-claims-data/&#60;/a&#62;&#60;/p&#62;
&#60;blockquote&#62;&#60;p&#62;IMPAQ International (IMPAQ) and NORC at the University of Chicago wish to promote the development of internet-based applications for users of the newly released Medicare Claims Public Use Files. IMPAQ and NORC, with support from Centers for Medicare and Medicaid Services (CMS), seek to develop web platforms suited for health services researchers, health policy researchers, data entrepreneurs, and/or other potential users who would like to access Medicare claims data. This online event challenges interdisciplinary teams to create a tool for researchers that will present two or more of the new Basic Stand Alone (BSA) Medicare Claims Public Use Files (PFUs) in a way that’s informative, interactive, and user friendly.&#60;/p&#62;&#60;/blockquote&#62;
&#60;p&#62;Prize: $7,500 and 2 tickets to Health 2.0 conference. &#60;/p&#62;
&#60;p&#62;Deadline: August 15
&#60;/p&#62;</description>
</item>
<item>
<title>Best graph/visualization for travel time reliability?</title>
<link>http://forums.flowingdata.com/topic/best-graphvisualization-for-travel-time-reliability#post-2213</link>
<pubDate>Thu, 26 May 2011 17:17:49 +0000</pubDate>
<dc:creator>mikev</dc:creator>
<guid isPermaLink="false">2213@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Can you describe the travel time measure a little bit more?  I've used excel to show project completion distributions. Basically, I we are graded on completion times for projects relative to their expected completion date at the beginning of the project.  It has to be finished within -10% and + 5% of the projects expected completion date. It's rather hard to explain in text, but I'd be happy to send you an email with an example.
&#60;/p&#62;</description>
</item>
<item>
<title>Gas Prices to Energy Company Stock Prices</title>
<link>http://forums.flowingdata.com/topic/gas-prices-to-energy-company-stock-prices#post-2199</link>
<pubDate>Wed, 11 May 2011 01:36:10 +0000</pubDate>
<dc:creator>bmdickey17</dc:creator>
<guid isPermaLink="false">2199@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Yes, I agree. I always thought an Integrated Gas Company's stock price would fluctuate based on the cost of Crude Oil, but there doesn't seem as strong of a relationship between Crude Oil Price Per Barrel and Gas Prices at the Pump. It makes me curious since there seems to be a stronger relationship between pump prices and stock prices than pump prices and crude oil. Obviously, they are all correlated, but it is interesting to see where they deviate. I haven't finished yet, but just wanted to gather others thoughts. Thanks.
&#60;/p&#62;</description>
</item>
<item>
<title>Gas Prices to Energy Company Stock Prices</title>
<link>http://forums.flowingdata.com/topic/gas-prices-to-energy-company-stock-prices#post-2196</link>
<pubDate>Tue, 03 May 2011 11:43:06 +0000</pubDate>
<dc:creator>nathany</dc:creator>
<guid isPermaLink="false">2196@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Interesting. I have a feeling the two are tightly coupled... like the two are affected by many of the same factors.
&#60;/p&#62;</description>
</item>
<item>
<title>Best graph/visualization for travel time reliability?</title>
<link>http://forums.flowingdata.com/topic/best-graphvisualization-for-travel-time-reliability#post-2195</link>
<pubDate>Mon, 02 May 2011 13:07:42 +0000</pubDate>
<dc:creator>LisaSmith</dc:creator>
<guid isPermaLink="false">2195@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;I've been tasked with creating a &#34;dashboard&#34; for performance measures. In the past we've just used Excel charts. Does anyone have an interesting way to demonstrate reliability using Excel and/or Xcelsius?
&#60;/p&#62;</description>
</item>

</channel>
</rss>

