Home

About
- Who We Are
- Community Guidelines
- Right to Respond

Advertising on BOR
- Advertise on BOR
- Buy on all Texas Blogs

Advertisements

Search




Advanced Search


Wired: Is the Internet Making Scientific Theory Obsolete?


by: Phillip Martin, Progress Texas

Thu Jul 24, 2008 at 02:10 PM CDT


This has very little to do with politics, and a whole lot to do with social behavior, how we think, and how humans are going to process data in the 21st century. I mean -- a rational argument that scientific theories are no longer necessary?

WOW. This was the most interesting thing I've read in months. Read about it below the fold.

ADVERTISEMENT

Wired: The End of Theory: The Data Deluge Makes the Scientific Method Obsolete

I'd recommend everyone read the whole article. Here's how it starts:

Sixty years ago, digital computers made information readable. Twenty years ago, the Internet made it reachable. Ten years ago, the first search engine crawlers made it a single database. Now Google and like-minded companies are sifting through the most measured age in history, treating this massive corpus as a laboratory of the human condition. They are the children of the Petabyte Age. 

The Petabyte Age? Dang. But then I thought about it, and it's fair enough -- I probably "read" 200 news stories a day. I go through all the major dailies in Texas, the major national papers, and then about thirty blogs on the state and national level.

The article then goes on to argue that Google -- and their new philosophy -- is largely responsible for the phenomenon: 

For instance, Google conquered the advertising world with nothing more than applied mathematics. It didn't pretend to know anything about the culture and conventions of advertising — it just assumed that better data, with better analytical tools, would win the day. And Google was right.

Google's founding philosophy is that we don't know why this page is better than that one: If the statistics of incoming links say it is, that's good enough.

And here's the crux of the argument: that because we can now amass such large statistical samples, we can use probability and algorithms to mathematically predict outcomes -- instead of creating theories to scientifically guess what the outcomes will be. Therefore, all we need are strong and large enough computer databases and programs to crunch the data.

The difference is subtle, but critically important.

Learning to use a "computer" of this scale may be challenging. But the opportunity is great: The new availability of huge amounts of data, along with the statistical tools to crunch these numbers, offers a whole new way of understanding the world. Correlation supersedes causation, and science can advance even without coherent models, unified theories, or really any mechanistic explanation at all.

Pretty interesting stuff. What do you think?

Tags: , , , , (All Tags)
Print Friendly View Send As Email
Frightening (0.00 / 0)
With enough data mining, you can make very good predictions about how a complicated system is going to behave. (I strongly recommend fivethirtyeight.com for doing that with politics.) But that's a world away from understanding anything. Or of making good use of what you've understood.

We live in an age where more and more people view "science" as just meaning a bunch of facts, without caring how those facts fit together, or how they're obtained. There's also a backlash against science, with huge numbers of people believing in creationism or astrology.

The Google model, in which everything is treated as a black box, will only make this worse.  


Yeah, that's all well and good (0.00 / 0)
Until the algorithms "prove" something the powers that be consider inconvenient for their cause.

Honestly, I'm no scientist, but I've had it up to my eye teeth with people who don't understand science and the scientific method.

I'm with Lorenzo. Knowledge without understanding, or facts that are too easily discounted are bad enough. Progress by factoid and theroid...is that really progress?


Frightening but exciting (0.00 / 0)
LSadun I agree that it is frightening but it's also exciting because we now can take more of the guesswork out of the actual process. This leads to more accurate analysis of future events. By detecting trends and patterns that have been proven by prior analysis we can hopefully spot the event and take action accordingly.

Of course the issue with this is that by NOT understanding the underlying model we can miss other influential factors that might affect the result set, those that were outside the data collection but within the scope of influence. The example of antidiarrheals listed in Spotting the Hot Zones is a great view on this. Without factoring in the two-for-one sale into the review you would think an e.coli outbreak was in effect. The problem is that the linkages can go on and on and on and ... you get the picture.

I personally think the two work in concert. We'll never get rid of the modeling process. Doing so would preclude other influencers. But putting the two together increases the integrity of results of the models.

Thanks for bringing this to our attention Phillip (200 news stories? Dang boy, take a break sometime :) ). You always seem to provide some brain-busters in your postings.


Chris Anderson has absolutely no clue... (5.00 / 1)
...what theory or modeling is about.  Google or anyone using sophisticated computational and applied mathematics certainly wouldn't make the claim that they were working without coherent models.  There most certainly is a general statistical (or mathematical, or computational) model underlying everything Google does.  Whether or not they take the time to interpret the parameters they estimate from their data and try to infer mechanisms is another question.  My guess is that they do, but whatever.  If they don't, they'll lose to the company that does.

And what does "Correlation supersedes causation" even really mean?  Does this mean Google will soon be waiting (virtually of course) on the checkout pages of commerce sites to post ads to users after they buy something?  If correlation superseded causation, that would mean advertising only after purchasing and charging vendors for the service would be a great business model.

And because Google gets a particular carefully circumscribed answer it is looking for out of data without needing to organize that answer and fit into a larger framework, doesn't mean doing so wouldn't be a great idea that would yield economic (or research) dividends if followed up on.  Nor does it mean that a smaller theoretical framework doesn't underlie those carefully circumscribed goals.

I'm a genome scientist, and people in my field (including myself) frequently use nifty computational tricks and algorithms (neural networks, hidden markov models, markov chain monte carlo methods, evolutionary programming, and other goggledygook-sounding methods) that seem to magically pull answers out of data without much supervision.  And for the particularly well developed methods, the casual genomicist might even be able to get a mediocre publication by finding a few correlations without an attempt at synthesis.  But that's just a testament to the ebb and flow of novelty of ideas (relativity), novel applications (the Manhattan project), and bread and butter technology that simply needs to be repeated (nuclear power plants).

What Google is attempting to do is relatively modest.  Given a window into our curiosity (what you are searching for and clicking on and the same data about a lot of other users) it puts a ton of stuff in front of our eyes in the expectation that tiny proportion of the offers will be clicked on.  Now, computationally, doing that successfully is no mean feat, and Google has marshaled data in an unparalleled way to provide just that in a better way than anyone else and has managed to make profit doing so.  But they'd be fools if they declared victory there and rested on their laurels.  There is no doubt that somebody (or a team of somebodies) is working at Google at this very moment on project that tries to synthesize their models and data in a larger framework that can make more specific predictions about the desires of their users.  Right now a search engine can't even answer a basic question like "Who appointed the first chief justice of the supreme court of the United States?"  What they can do is spit back a lot of results, the first few of which will contain the answer if you choose to read them further...

No, Google is doing applied science at the very least (and is almost certainly doing its fair share of basic research too).  Chris simply doesn't understand what science is.


Wow (0.00 / 0)
Great response...thanks for the comment, gives me a good amount to think about.

I love this conversation.

Now, a very great man once said that some people rob you with a fountain pen.


[ Parent ]
Yeah, I love this topic too (0.00 / 0)
That was a good post.  I re-read my response and it looks like a rant!  :-o  I guess that's just my style.

I'm not sure if Chris Anderson intentionally phrased it that way just to provoke a response, or if he really doesn't know what's going on.  However, even a strawman like that gets the old juices flowing and provokes and interesting discussion.  I love this stuff too.

One thing he does get right...  What we can do with computation and data now is astounding and nearly magical.  I've written some of those programs on a much smaller scale and I understand them intimately, and I'm still amazed at what they spit out to me!  ( I do try to fit the results into larger theory, too...  :P )


[ Parent ]
Ooops (0.00 / 0)
I meant your post was a good one.  Not mine.  I'm tired.  I'm about to go to bed.  I'm on the other side of the world and it is late late...

[ Parent ]
May be a rant but plenty to think on (0.00 / 0)
I enjoyed your comment/rant and still need to chew on it. As I stated in my comment I think there's a balance between the two. With the computational power now available through cloud computing there are some things that can be done through brute force that were not possible before.  However you still need the models to know how to scope and confine the problem.  Otherwise you attempt to boil the ocean.

I think Chris is trying to provoke a conversation using some examples as conversation starters. Apparently he was successful as this is increasing the chatter on the subject. With regards to behavioral aspects such as voting and consumption I wonder if this is sort of a "wisdom of crowds" approach to the problem.  


[ Parent ]
Connect With BOR
Your source for Texas politics.

On Facebook: BOR
On Twitter: @BOR
On the Go: Mobile App

Upcoming BOR Events

"Do I Look Illegal?"
Arizona GOP Debate Watch

Wednesday, February 22
6:00-9:00 p.m.
Angie's Restaurant
1307 E. 7th Street
RSVP on Facebook

Save The Date:
Super Tuesday Super Watch Party!
Tuesday, March 6
6:00-10:00 p.m.
Scholz Garten
1607 San Jacinto



Menu

Make a New Account

Username:

Password:



Forget your username or password?


Shared On Facebook

Advertisement

Best of Texas Left
- (Complete Directory)
- B & B
- Bay Area Houston
- Blue Bloggin
- Bluedaze
- Brains and Eggs
- Capitol Annex
- Collin County Democrats
- Collin County Observer
- Community Forum
- Dog Canyon
- Dos Centavos
- Easter Lemming Liberal
- Eye on Williamson County
- Feet to the Fire
- Grading Texas
- Greg's Opinion
- Grits for Breakfast
- Half Empty
- Houtopia
- In the Pink Texas
- Kiss My Big Blue Butt
- Letters from Texas
- McBlogger
- Mean Rachel
- Musings
- North Texas Liberal
- Off the Kuff
- Panhandle Truth Squad
- Para Justicia y Libertad!
- Pink Dome
- San Antonio Mayor
- South Texas Chisme
- StoudDemBlog
- Texas Clover Leaf
- Texas Kaos
- The Caucus Blog
- There..Already
- Three Wise Men
Best of Texas Right
- Blogs of War
- BlogHouston
- Boots and Sabers
- Lone Star Times
- Publius TX
- Rick Perry vs the World
- Safety for Dummies
- Slightly Rough
- Urban Grounds
Other Texas Reads
- Burka Blog
- D Magazine
- DOT Show
- Statesman Elections
- Strong Political Analysis
- Texas Monthly
- Texas Observer
- The Texas Blue
- Quorum Report Daily Buzz
Around Austin
- Austin Bloggers
- Austin Chronicle
- Austin Contrarian
- Austin Metblogs
- Austin on Two Wheels
- Austin Real Estate Blog
- Austin Statesman
- Austin Texas Bike Shit Stuff
- Austin Towers
- Austinist
- Capital MetroBlog
- Daily Texan
- Do512
- Downtown Austin Blog
- East Austinite
- Elise Hu
-
Flash Mob Austin
- Keep Austin Blue
- M1EK
- Travis County Democrats
- University Democrats
TX Progressive Orgs
- ACLU Legislative Blog
- Atticus Circle
- Criminal Justice Coalition
- Equality Texas
- NOW Texas
- PFAW Texas
- Public Citizen
- SEIU Texas
- Tejano Insider
- Texas AFT
- Texas HDCC
- Texas Watch
- TFN
- TSTA
- TSEU
- Texas Young Democrats
- United Ways of Texas
TX Elections/Returns
- TX Returns 1992-present
- TX Media/Candidate List

- Bexar County
- Collin County
- Dallas county
- Denton County
- El Paso County
- Fort Bend County
- Harris County
- Jefferson County
- Tarrant County
- Travis County

- CNN 1998 Returns
- CNN 2000 Returns
- CNN 2002 Returns
- CNN 2004 Returns
- CNN 2006 Returns
- CNN 2008 Returns
Traffic Ratings
- Alexa Rating
- Quantcast Ratings
-
Syndication

Powered by: SoapBlox