<?xml version="1.0" encoding="UTF-8"?><!-- generator="bbPress" -->

<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
>

<channel>
<title>FlowingData Forums &#187; Forum: Statistics and Data - Recent Topics</title>
<link>http://forums.flowingdata.com/</link>
<description>Strength in Numbers</description>
<language>en</language>
<pubDate>Sat, 11 Feb 2012 22:04:14 +0000</pubDate>

<item>
<title>Getting Started w/ your.flowingdata</title>
<link>http://forums.flowingdata.com/topic/getting-started-w-yourflowingdata#post-2525</link>
<pubDate>Wed, 08 Feb 2012 17:54:59 +0000</pubDate>
<dc:creator>eklypse</dc:creator>
<guid isPermaLink="false">2525@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;I'd like to use yfd on a protected account. That is, anyone who wants to follow me must submit a request which I can then approve. I hoped that whatever automated feature that would cause yfd to follow back would thus generate such a request which I could then approve, but as yet, no request has appeared. I started following @yfd a couple hours ago, so I figured the request would appear by now. Is there anything I can do to make this work (short of turning off the privacy on my account)? Thanks in advance!
&#60;/p&#62;</description>
</item>
<item>
<title>Looking for a Starbucks calculator</title>
<link>http://forums.flowingdata.com/topic/looking-for-a-starbucks-calculator#post-2468</link>
<pubDate>Wed, 04 Jan 2012 07:25:13 +0000</pubDate>
<dc:creator>brcire</dc:creator>
<guid isPermaLink="false">2468@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;In search of a Starbucks calculator that shows/graphs how much you spend per week, month, per year(s) and runs a parallel result of how that money would look if you invested it at standard interest rates.&#60;/p&#62;
&#60;p&#62;I recall one several years ago but cannot find it.&#60;/p&#62;
&#60;p&#62;Thanks for your help!&#60;br /&#62;
Eric
&#60;/p&#62;</description>
</item>
<item>
<title>Where to find worldwide sunrise and sunset data set?</title>
<link>http://forums.flowingdata.com/topic/where-to-find-worldwide-sunrise-and-sunset-data-set#post-2459</link>
<pubDate>Thu, 29 Dec 2011 23:22:54 +0000</pubDate>
<dc:creator>patrickberkeley</dc:creator>
<guid isPermaLink="false">2459@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Hi, does anyone know where I could find an open data set containing worldwide sunrise and sunset times? Thanks, Patrick
&#60;/p&#62;</description>
</item>
<item>
<title>Available:  CRS, Health Insurance Coverage by State and Congressional District</title>
<link>http://forums.flowingdata.com/topic/available-crs-health-insurance-coverage-by-state-and-congressional-district#post-2390</link>
<pubDate>Mon, 24 Oct 2011 19:26:26 +0000</pubDate>
<dc:creator>prescottrjp</dc:creator>
<guid isPermaLink="false">2390@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Available:&#60;br /&#62;
&#60;a href=&#34;http://goo.gl/WgKu5&#34; rel=&#34;nofollow&#34;&#62;http://goo.gl/WgKu5&#60;/a&#62;&#60;br /&#62;
Congressional Research Service, Health Insurance Coverage by State and Congressional District;&#60;br /&#62;
&#60;a href=&#34;http://goo.gl/WgKu5&#34; rel=&#34;nofollow&#34;&#62;http://goo.gl/WgKu5&#60;/a&#62;&#60;/p&#62;
&#60;p&#62;Summary&#60;br /&#62;
The total U.S. civilian non-institutionalized population in 2010 was estimated to be slightly more than 304 million. Roughly 84.5% of the U.S. civilian non-institutionalized population had one or more forms of health insurance, while 15.5%, or roughly 47.2 million, were uninsured. The most common form of insurance was employer provided.&#60;br /&#62;
This report employs the U.S. Census Bureau’s 2010 American Community Survey (ACS) to describe health insurance coverage and provide estimates of coverage by type of coverage at the national, state, and congressional district level. The ACS survey has a sample of more than 2 million respondents and solicits health insurance coverage information as of the date of the survey. The sample is large enough to provide accurate estimates of coverage at the congressional district level at a point in time. As this report details, there are considerable differences across states, within states, and across demographic groups in the proportion of insured and their sources of coverage.
&#60;/p&#62;</description>
</item>
<item>
<title>Creating matrix for heatmap in R</title>
<link>http://forums.flowingdata.com/topic/creating-matrix-for-heatmap-in-r#post-2339</link>
<pubDate>Tue, 13 Sep 2011 04:27:54 +0000</pubDate>
<dc:creator>harald</dc:creator>
<guid isPermaLink="false">2339@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;I followed the excellent FlowingData tutorial &#34;How to Make a Heatmap – a Quick and Easy Solution&#34;: &#60;a href=&#34;http://flowingdata.com/2010/01/21/how-to-make-a-heatmap-a-quick-and-easy-solution/&#34; rel=&#34;nofollow&#34;&#62;http://flowingdata.com/2010/01/21/how-to-make-a-heatmap-a-quick-and-easy-solution/&#60;/a&#62;&#60;/p&#62;
&#60;p&#62;In the tutorial, the baseball dataset was already prepared as a matrix with players as rows and columns with various points. What is the easiest way of creating a dataset like the baseball dataset &#60;a href=&#34;http://datasets.flowingdata.com/ppg2008.csv&#34; rel=&#34;nofollow&#34;&#62;http://datasets.flowingdata.com/ppg2008.csv&#60;/a&#62; in R if the data has the format: &#60;/p&#62;
&#60;p&#62;Dwyane Wade, G, 79&#60;br /&#62;
Dwyane Wade, MIN, 38.6&#60;br /&#62;
Dwyane Wade, PTS, 30.2&#60;br /&#62;
Dwyane Wade, FGM, 10.8&#60;br /&#62;
Dwyane Wade, FGA, 22&#60;br /&#62;
Dwyane Wade, FGP, 0.491&#60;br /&#62;
Dwyane Wade, FTM, 7.5&#60;br /&#62;
Dwyane Wade, FTA, 9.8&#60;br /&#62;
Dwyane Wade, FTP, 0.765&#60;br /&#62;
Dwyane Wade, 3PM, 1.1&#60;br /&#62;
Dwyane Wade, 3PA, 3.5&#60;br /&#62;
Dwyane Wade, 3PP, 0.317&#60;br /&#62;
Dwyane Wade, ORB, 1.1&#60;br /&#62;
Dwyane Wade, DRB, 3.9&#60;br /&#62;
Dwyane Wade, TRB, 5&#60;br /&#62;
Dwyane Wade, AST, 7.5&#60;br /&#62;
Dwyane Wade, STL, 2.2&#60;br /&#62;
Dwyane Wade, BLK, 1.3&#60;br /&#62;
Dwyane Wade, TO, 3.4&#60;br /&#62;
Dwyane Wade, PF , 2.3&#60;br /&#62;
LeBron James, G, 81&#60;br /&#62;
LeBron James, MIN, 37.7&#60;br /&#62;
LeBron James, PTS, 28.4&#60;br /&#62;
LeBron James, FGM, 9.7&#60;br /&#62;
LeBron James, FGA, 19.9&#60;br /&#62;
LeBron James, FGP, 0.489&#60;br /&#62;
LeBron James, FTM, 7.3&#60;br /&#62;
LeBron James, FTA, 9.4&#60;br /&#62;
LeBron James, FTP, 0.78&#60;br /&#62;
LeBron James, 3PM, 1.6&#60;br /&#62;
LeBron James, 3PA, 4.7&#60;br /&#62;
LeBron James, 3PP, 0.344&#60;br /&#62;
LeBron James, ORB, 1.3&#60;br /&#62;
LeBron James, DRB, 6.3&#60;br /&#62;
LeBron James, TRB, 7.6&#60;br /&#62;
LeBron James, AST, 7.2&#60;br /&#62;
LeBron James, STL, 1.7&#60;br /&#62;
LeBron James, BLK, 1.1&#60;br /&#62;
LeBron James, TO, 3&#60;br /&#62;
LeBron James, PF, 1.7&#60;br /&#62;
....Kobe Bryant ...cont....&#60;br /&#62;
...cont....&#60;/p&#62;
&#60;p&#62;Doing such a transformation is straightforward in SQL or even a spreadsheet, but it would be great to know how to get it right in R.
&#60;/p&#62;</description>
</item>
<item>
<title>looking for a readable management (?) blog</title>
<link>http://forums.flowingdata.com/topic/looking-for-a-readable-management-blog#post-2358</link>
<pubDate>Mon, 26 Sep 2011 18:12:35 +0000</pubDate>
<dc:creator>justin_abraham</dc:creator>
<guid isPermaLink="false">2358@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;I am a mid-level manager at a Fortune 500 company with some time on my hands this quarter (don't ask). I am trying to broaden my horizons online, but everything I find is either:&#60;/p&#62;
&#60;p&#62;a) really interesting analytics/visualization stuff, which is sadly not my job&#60;br /&#62;
b) disgruntled developers channeling Dilbert&#60;br /&#62;
c) management or IT consultants trying to sell their wares with unreadable recycled jargon&#60;/p&#62;
&#60;p&#62;Will someone please recommend something as engaging, simple, and thought provoking as this site, but geared towards, say data warehousing or ETL? or even management? or are those topics just inherently boring (to write about)?
&#60;/p&#62;</description>
</item>
<item>
<title>Android app / widget?</title>
<link>http://forums.flowingdata.com/topic/android-app-widget#post-2298</link>
<pubDate>Wed, 10 Aug 2011 05:00:15 +0000</pubDate>
<dc:creator>snataas</dc:creator>
<guid isPermaLink="false">2298@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Does anyone know of plans for an Android app, ideally with a widget for simple entry into your.flowingdata.com?&#60;br /&#62;
I know there is one on github but it seems not to be available in the market.&#60;br /&#62;
Thanks.
&#60;/p&#62;</description>
</item>
<item>
<title>Developer Contest with Infochimps data</title>
<link>http://forums.flowingdata.com/topic/developer-contest-with-infochimps-data#post-2250</link>
<pubDate>Tue, 05 Jul 2011 10:38:58 +0000</pubDate>
<dc:creator>nathany</dc:creator>
<guid isPermaLink="false">2250@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Build a quick app with Infochimps data and win the Tufte books:&#60;/p&#62;
&#60;p&#62;&#60;a href=&#34;http://blog.infochimps.com/2011/07/04/new-developer-contest-twilio-infochimps/&#34; rel=&#34;nofollow&#34;&#62;http://blog.infochimps.com/2011/07/04/new-developer-contest-twilio-infochimps/&#60;/a&#62;
&#60;/p&#62;</description>
</item>
<item>
<title>2010 Census Data got me down</title>
<link>http://forums.flowingdata.com/topic/2010-census-data-got-me-down#post-2230</link>
<pubDate>Tue, 14 Jun 2011 12:47:04 +0000</pubDate>
<dc:creator>joemurphy</dc:creator>
<guid isPermaLink="false">2230@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;All I want is to get average age by zip/tract but I'm lost. I've tried getting the 2010 census demogrphic profiles into access, but was left wanting to throw my computer out the window. &#60;/p&#62;
&#60;p&#62;Advice?&#60;/p&#62;
&#60;p&#62;&#60;a href=&#34;http://2010.census.gov/2010census/data/&#34; rel=&#34;nofollow&#34;&#62;http://2010.census.gov/2010census/data/&#60;/a&#62;
&#60;/p&#62;</description>
</item>
<item>
<title>Medicare Claims Data Challenge</title>
<link>http://forums.flowingdata.com/topic/medicare-claims-data-challenge#post-2220</link>
<pubDate>Fri, 03 Jun 2011 01:17:23 +0000</pubDate>
<dc:creator>nathany</dc:creator>
<guid isPermaLink="false">2220@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Looks like it might be a good one:&#60;/p&#62;
&#60;p&#62;&#60;a href=&#34;http://www.health2challenge.org/2011/06/01/medicare-claims-data/&#34; rel=&#34;nofollow&#34;&#62;http://www.health2challenge.org/2011/06/01/medicare-claims-data/&#60;/a&#62;&#60;/p&#62;
&#60;blockquote&#62;&#60;p&#62;IMPAQ International (IMPAQ) and NORC at the University of Chicago wish to promote the development of internet-based applications for users of the newly released Medicare Claims Public Use Files. IMPAQ and NORC, with support from Centers for Medicare and Medicaid Services (CMS), seek to develop web platforms suited for health services researchers, health policy researchers, data entrepreneurs, and/or other potential users who would like to access Medicare claims data. This online event challenges interdisciplinary teams to create a tool for researchers that will present two or more of the new Basic Stand Alone (BSA) Medicare Claims Public Use Files (PFUs) in a way that’s informative, interactive, and user friendly.&#60;/p&#62;&#60;/blockquote&#62;
&#60;p&#62;Prize: $7,500 and 2 tickets to Health 2.0 conference. &#60;/p&#62;
&#60;p&#62;Deadline: August 15
&#60;/p&#62;</description>
</item>
<item>
<title>Best graph/visualization for travel time reliability?</title>
<link>http://forums.flowingdata.com/topic/best-graphvisualization-for-travel-time-reliability#post-2195</link>
<pubDate>Mon, 02 May 2011 13:07:42 +0000</pubDate>
<dc:creator>LisaSmith</dc:creator>
<guid isPermaLink="false">2195@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;I've been tasked with creating a &#34;dashboard&#34; for performance measures. In the past we've just used Excel charts. Does anyone have an interesting way to demonstrate reliability using Excel and/or Xcelsius?
&#60;/p&#62;</description>
</item>
<item>
<title>Gas Prices to Energy Company Stock Prices</title>
<link>http://forums.flowingdata.com/topic/gas-prices-to-energy-company-stock-prices#post-2194</link>
<pubDate>Sun, 01 May 2011 15:35:22 +0000</pubDate>
<dc:creator>bmdickey17</dc:creator>
<guid isPermaLink="false">2194@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;I put together an Excel doc with a Graph that showed a 91% correlation between Gas Prices and Chevron's Stock Price from Dec 2009 until Present. Has anyone else done any analysis like this or have any similar numbers? I can post my Excel Doc on Google Docs for everyone to review. I would love your feedback. Thanks.&#60;/p&#62;
&#60;p&#62;I did have problems formatting the data on Google Docs, I will try to work out the issues, but let me know what you think.&#60;/p&#62;
&#60;p&#62;&#60;a href=&#34;https://spreadsheets.google.com/ccc?key=0AlEhmjhIhqtZdFg2dFlrYjlmaGEweW5HeERaWVJubVE&#38;amp;hl=en&#34; rel=&#34;nofollow&#34;&#62;https://spreadsheets.google.com/ccc?key=0AlEhmjhIhqtZdFg2dFlrYjlmaGEweW5HeERaWVJubVE&#38;amp;hl=en&#60;/a&#62;
&#60;/p&#62;</description>
</item>
<item>
<title>When does amnt of skew/prevalence of outliers make median preferable over mean?</title>
<link>http://forums.flowingdata.com/topic/when-does-amnt-of-skewprevalence-of-outliers-make-median-preferable-over-mean#post-2190</link>
<pubDate>Thu, 28 Apr 2011 01:55:17 +0000</pubDate>
<dc:creator>mehta_april</dc:creator>
<guid isPermaLink="false">2190@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;I know that &#34;deviations in the data are devil&#34;, and when the distribution is highly skewed, it is better to consider median as average rather than mean, but how to decide these hard-limits.&#60;/p&#62;
&#60;p&#62;For example:&#60;/p&#62;
&#60;p&#62;* CASE 1:&#60;br /&#62;
   *  Assume X = 10,20,30,40,50,60,70&#60;br /&#62;
   * In this case, I think that it is better to use mean and that it will give very accurate results.&#60;/p&#62;
&#60;p&#62;* CASE 2:&#60;br /&#62;
   * Assume X = 10,20,30,40,50,60,70,7000&#60;br /&#62;
   * In this case, I think that it is better to use median instead of using the mean.&#60;/p&#62;
&#60;p&#62;* CASE 3:&#60;br /&#62;
   * Assume X = 10,20,30,400,500,600,700&#60;br /&#62;
   * In this case, I think it is better to use IQR (Inter Quartile Range)&#60;/p&#62;
&#60;p&#62;But I'm stuck with how to decide these hard-limits i.e. which to use in which condition, in general. &#60;/p&#62;
&#60;p&#62;I've found a tool working on subjected principle, which takes context-less sample-distribution as input and determines whether mean is close/moderate or against the null-hypothesis.&#60;/p&#62;
&#60;p&#62;Find References:- &#60;/p&#62;
&#60;p&#62;* &#60;a href=&#34;http://home.ubalt.edu/ntsbarsh/Business-stat/otherapplets/MeanTest.htm&#34; rel=&#34;nofollow&#34;&#62;http://home.ubalt.edu/ntsbarsh/Business-stat/otherapplets/MeanTest.htm&#60;/a&#62; (For Mean Test)&#60;br /&#62;
* &#60;a href=&#34;http://home.ubalt.edu/ntsbarsh/Business-stat/otherapplets/MediansTest.htm&#34; rel=&#34;nofollow&#34;&#62;http://home.ubalt.edu/ntsbarsh/Business-stat/otherapplets/MediansTest.htm&#60;/a&#62; (For Median Test)&#60;/p&#62;
&#60;p&#62;What I'm really looking is a good answer which states how to derive these conclusions.&#60;/p&#62;
&#60;p&#62;Any help or learning pointers will be highly appreciated.
&#60;/p&#62;</description>
</item>
<item>
<title>Voronoi</title>
<link>http://forums.flowingdata.com/topic/voronoi#post-2177</link>
<pubDate>Fri, 15 Apr 2011 17:41:41 +0000</pubDate>
<dc:creator>frmous</dc:creator>
<guid isPermaLink="false">2177@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;I'm designing a UI that lets the user determine the proportions of 4 items. (possibly more in the future)&#60;/p&#62;
&#60;p&#62;I was impressed with all the UI's that let me build voronoi cells from points.  Unfortunately, my user needs it easy. Drag on a cell, cell inflates, others deflate. Fin! &#60;/p&#62;
&#60;p&#62;The problem is I can't figure out a way to give my code a ratio and have it build a polygon based on how much area it should take up. &#60;/p&#62;
&#60;p&#62;Any ideas?  I could end up doing the rectangular tree map instead.  That seems easier to code,  too bad the voronoi look astounding.&#60;/p&#62;
&#60;p&#62;Anyone have any ideas?
&#60;/p&#62;</description>
</item>
<item>
<title>Collecting Data Beer (Survey!)</title>
<link>http://forums.flowingdata.com/topic/collecting-data-beer-survey#post-2162</link>
<pubDate>Thu, 07 Apr 2011 07:24:05 +0000</pubDate>
<dc:creator>Anon E Moose</dc:creator>
<guid isPermaLink="false">2162@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Hello, I really love this website and thought everyone might lend a hand and help me collect some data! I am doing some marketing research for school and would like you to complete a quick survey online:&#60;br /&#62;
&#60;a href=&#34;http://bit.ly/frZe57&#34; rel=&#34;nofollow&#34;&#62;http://bit.ly/frZe57&#60;/a&#62;&#60;/p&#62;
&#60;p&#62;There is an option at the end to sign up for any free beer tastings in (if you're in the Atlanta area, and you must be 21 or over and have a valid ID).  If you would like to remain anonymous don't put your email address at the end.  &#60;/p&#62;
&#60;p&#62;Thanks, your opinion would be much appreciated and you could get information about a future brew night or tasting in your area!
&#60;/p&#62;</description>
</item>
<item>
<title>Parsons Journal for Information Mapping Call for Papers and Projects</title>
<link>http://forums.flowingdata.com/topic/parsons-journal-for-information-mapping-call-for-papers-and-projects-1#post-2160</link>
<pubDate>Wed, 06 Apr 2011 13:18:48 +0000</pubDate>
<dc:creator>Katie4PJIM</dc:creator>
<guid isPermaLink="false">2160@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;PJIM – Launching our new, interactive platform – submit your abstract today&#60;/p&#62;
&#60;p&#62;Abstracts due: May 16th 2011&#60;br /&#62;
Full submissions due: June 20th 2011 &#60;/p&#62;
&#60;p&#62;Already in our third year, The Parsons Journal for Information Mapping (PJIM) has published ground-breaking works from a truly international set of contributors. In every article and issue our contributors present their research, explorations, and implementations of innovative information design. Our contributors form an elite group of experts from a wide range of professional disciplines and academic institutions.&#60;/p&#62;
&#60;p&#62;With this coming issue – published April 18th – we are launching our brand new, interactive publishing platform. The platform is an archive of all past and present issues, allows our contributors and readers to interact online, supports advanced searching and filtering, and provides an array of options to read and share every article we publish. Our new web portal is designed to support our contributors' needs to publicize their work, gain international recognition, and interact and share ideas with our broad subscriber base.&#60;/p&#62;
&#60;p&#62;We are currently accepting new abstract submissions for our upcoming issue and looking for works focused on information design, data/knowledge visualization, taxonomies, data analytics, informatics, information systems, and graphic user interface design.&#60;/p&#62;
&#60;p&#62;If you are looking to leverage PJIM's interactive publishing platform and publicize your ground-breaking work to an international audience please submit your abstract today.&#60;/p&#62;
&#60;p&#62;To submit your work, please visit &#60;a href=&#34;http://piim.newschool.edu/journal/submissions&#34; rel=&#34;nofollow&#34;&#62;http://piim.newschool.edu/journal/submissions&#60;/a&#62;.&#60;/p&#62;
&#60;p&#62;PJIM is a quarterly publication of The Parsons Institute for Information Mapping – a research, development, and profession services facility in New York, NY. For more information please visit us at &#60;a href=&#34;http://piim.newschool.edu&#34; rel=&#34;nofollow&#34;&#62;http://piim.newschool.edu&#60;/a&#62;
&#60;/p&#62;</description>
</item>
<item>
<title>Discuss: Possible Shutdown of Data.gov and 7 other US Gov. open data sites</title>
<link>http://forums.flowingdata.com/topic/discuss-possible-shutdown-of-datagov-and-7-other-us-gov-open-data-sites#post-2154</link>
<pubDate>Tue, 05 Apr 2011 12:14:13 +0000</pubDate>
<dc:creator>monserrate</dc:creator>
<guid isPermaLink="false">2154@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;I just read Nathan's post on Guardian Datablog and thought I might try to gather the intelligent discussions around the web and see what people here would like to add:&#60;/p&#62;
&#60;p&#62;&#60;strong&#62;Discussions and Reporting on Data.gov Shutdown on web:&#60;/strong&#62;&#60;/p&#62;
&#60;ul&#62;
&#60;li&#62;&#60;a href=&#34;http://www.guardian.co.uk/news/datablog/2011/apr/05/data-gov-crisis-obama&#34; rel=&#34;nofollow&#34;&#62;Guardian Datablog&#60;/a&#62;&#60;/li&#62;
&#60;li&#62;&#60;a href=&#34;http://news.ycombinator.com/item?id=2393875&#34; rel=&#34;nofollow&#34;&#62;HackerNews Discussion&#60;/a&#62;&#60;/li&#62;
&#60;li&#62;&#60;a href=&#34;http://www.readwriteweb.com/archives/datagov_7_other_sites_to_shut_down_after_budgets_c.php&#34; rel=&#34;nofollow&#34;&#62;ReadWriteWeb Post&#60;/a&#62;&#60;/li&#62;
&#60;li&#62;&#60;a href=&#34;http://sunlightfoundation.com/savethedata/&#34; rel=&#34;nofollow&#34;&#62;Sunlight Foundations &#34;Save the Data&#34; Petition&#60;/a&#62;&#60;/li&#62;
&#60;li&#62;&#60;a href=&#34;http://gov20.govfresh.com/congress-weighs-deep-cuts-to-funding-for-federal-open-government-data-platforms/&#34; rel=&#34;nofollow&#34;&#62;Alex Howard's Reporting on the topic on GovFresh&#60;/a&#62;&#60;/li&#62;
&#60;/ul&#62;
&#60;p&#62;&#60;strong&#62;Key Points: &#60;/strong&#62;&#60;/p&#62;
&#60;ol&#62;
&#60;li&#62;Budget would potentially be reduced from 37 million to 2 million&#60;/li&#62;
&#60;li&#62;Affected sites are Data.gov, USASpending.gov, Apps.gov, [what are the others?]&#60;/li&#62;
&#60;li&#62;Congress would vote on this cut in the next 4 months [exact date?]&#60;/li&#62;
&#60;/ol&#62;</description>
</item>
<item>
<title>How do I break into statistics, machine learning (next steps in education)</title>
<link>http://forums.flowingdata.com/topic/how-do-i-break-into-statistics-machine-learning-next-steps-in-education#post-1862</link>
<pubDate>Tue, 24 Aug 2010 23:09:02 +0000</pubDate>
<dc:creator>sobral</dc:creator>
<guid isPermaLink="false">1862@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Hi everyone,&#60;/p&#62;
&#60;p&#62;I recently finished my bachelor's degree in computer science at Berkeley. Although it may be a bit late, I am just now getting interested in learning more about statistics, machine learning, and more math. Unfortunately, I don't have much of a math background (only took up to Linear Algebra) and the required probability/discrete math course for CS. &#60;/p&#62;
&#60;p&#62;I have already started working full time in a field that won't push me to learn about this area (work is mainly systems stuff). However, I am considering pursuing a MS in CS down the line because I do miss the academic environment. Probably not in the next year though. &#60;/p&#62;
&#60;p&#62;In the meantime, I would like to expose myself and learn about statistics, machine learning, and the required math it takes to understand these fields. Also, I would like to learn more about information visualization.&#60;/p&#62;
&#60;p&#62;I have some questions about all of this...&#60;/p&#62;
&#60;p&#62;1) What are some good sources to get myself exposed to statistics, ML, required Math? I understand that there are many different areas, but a general guideline would be helpful.&#60;/p&#62;
&#60;p&#62;2) Will an MS in CS with a concentration in AI/Machine Learning be enough of a background to work in &#34;data science&#34; as a career? Or will a PhD be preferred?&#60;/p&#62;
&#60;p&#62;I have limited research experience, but I already miss the academic setting of university.&#60;/p&#62;
&#60;p&#62;3) How much artistic talent does it take to perform well in information visualization. I wouldn't consider myself a very artsy person. Sorry if this seems like a dumb question.&#60;/p&#62;
&#60;p&#62;Any advice/help would be greatly appreciated.&#60;/p&#62;
&#60;p&#62;Thanks!
&#60;/p&#62;</description>
</item>
<item>
<title>Serving by Weight</title>
<link>http://forums.flowingdata.com/topic/serving-by-weight#post-2119</link>
<pubDate>Tue, 01 Mar 2011 10:11:02 +0000</pubDate>
<dc:creator>imrancs</dc:creator>
<guid isPermaLink="false">2119@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;For example I have this data:&#60;/p&#62;
&#60;p&#62;headings = {&#60;br /&#62;
         :heading1 =&#38;gt; { :weight =&#38;gt; 60, :show_count =&#38;gt; 0}&#60;br /&#62;
         :heading2 =&#38;gt; { :weight =&#38;gt; 10, :show_count =&#38;gt; 0}&#60;br /&#62;
         :heading3 =&#38;gt; { :weight =&#38;gt; 20, :show_count =&#38;gt; 0}&#60;br /&#62;
         :heading4 =&#38;gt; { :weight =&#38;gt; 10, :show_count =&#38;gt; 0}&#60;br /&#62;
       }&#60;br /&#62;
total_views = 0&#60;/p&#62;
&#60;p&#62;Now I want to serve each heading based on their weightages. For instance, for first 10 requests/iterations, heading1, heading3, heading2 and heading4 would be served 6, 2, 1, and 1 times respectively in order (by weight).&#60;/p&#62;
&#60;p&#62;For every iteration show_count of served heading will increment by one and total_views will also increment globally.&#60;/p&#62;
&#60;p&#62;Could you please suggest an algorithm or some ruby code to handle this?
&#60;/p&#62;</description>
</item>
<item>
<title>O&#039;Reilly&#039; Strata Conference</title>
<link>http://forums.flowingdata.com/topic/oreilly-strata-conference#post-2043</link>
<pubDate>Sun, 16 Jan 2011 06:05:05 +0000</pubDate>
<dc:creator>SmeeZee</dc:creator>
<guid isPermaLink="false">2043@http://forums.flowingdata.com/</guid>
<description>&#60;p&#62;Anyone know if there is an equivalent in the UK? I would dearly love to go, but getting out to Santa Clara is quite an expense on top of the conference cost.&#60;/p&#62;
&#60;p&#62;If there is an equivalent, I would have thought there would be sufficient interest over here (in the UK)? Any thoughts?
&#60;/p&#62;</description>
</item>

</channel>
</rss>

