Thursday, March 29, 2012

Answer: How long is it?

Image of Ball's Pyramid from Wikimedia.

Yesterday's question had two parts: 
1.  What was the name of the ship that nearly caused the extinction of this amazing insect?  (For extra credit, why would a ship running aground have such a dire consequence?)  

2.  How long is the body of the insect when it first hatches from the egg?  (This is also totally surprising.)  


Most people found part 1 to be pretty straightforward.  A query like [large insect island] quickly leads to the NPR story by Robert Krulwich about the Lord Howe's Walking Stick  that ran just a few weeks ago... unfortunately for me as that article (which is a great story!) made this search a bit TOO simple.  


From the Krulwich report that you can read the story of how the S. S. Makambo ran aground Lord Howe's island in 1918.  If you do a few searches to second source his story, you can find lots of supporting evidence.  (It's not that I don't trust Robert Krulwich, it's just that I check almost everything.)  Rats from the ship went ashore on Lord Howe's island and wiped out the insects pretty quickly.  A sadly common story throughout much of the South Pacific islands--invading species destroys endemic animals and plants.

The second question, though, is a bit more difficult.  

Several readers did a smart thing and used the scientific name of the insect to search for more biological information.  This is a good strategy to remember: When you're searching for detailed scientific data, you usually want to use the binomial version of the name (that is, the pattern is Genus species, the scientific Latinate version of the animal's name).  In this case, it's Dryococelus australis, the Lord Howe's Walking Stick. 

So a quick search for [Dryococelus australis nymph size] (or [... hatchling size]) quickly leads you to the remarkable (and very unexpected) Husbandry Manual for the Lord Howe Island Stick Insect issued by Australasian Society for Zookeeping.  Yes, there are a group of people who have collected all known information about how to raise and keep these insects happily in captivity.  

In that text, the following sentence appears (this has got to be one of my favorite sentences in all the scientific literature):  "LHISI nymphs measure approximately 20mm on hatching, varying between 16-22mm. They emerge from the egg usually underground, and burrow to the surface to climb the nearest large object. If kept in Chinese takeaway containers they will always be found clinging to the underside of the lid."  

And that would seem to be that.  

Except that I (like many of you) also watched the video of the LHISI emerging from its egg case.  And when the operculum on the egg popped open, an amazing large insect emerged.  (Thanks to Rohan Cleave at the Melbourne Zoo for posting that video.)  

I also looked around  http://www.flickr.com/photos/hornets_inverts/6375766233/in/photostream/ and since there's a hand in the photo, I can estimate size across the opening hatch at 0.15 inches.  (I later found that the Husbandry Manual tells us that the operculum is 3.6mm wide, which is pretty much the same thing.)  


 I grabbed one of the final frames from the video and measured the insect in terms of "operculum widths."  Each little yellow line in the second image is around 1 operculum width (as estimated from the same image) and then drawn atop the newly hatched nymph.  As you can see, the body is 8 operculums (opercula?) long, and 15 operculums from tip to toe.  If each operculum is 3.6mm, that means the body is 8 * 3.6mm = 28.8 mm (1.1 inches) and overall it is 15 * 3.6mm = 54mm (2.1 inches).  

No matter how you look at it, that's one big insect out of an egg that's the "size and shape of a TicTac."

So... how did you do? 


The average time to solve this challenge was 13.0 minutes; the median time was 10.0 minutes.


But what's always interesting to me (in addition to the amount of time it takes) are the answers we see in the comments.  Here's the distribution of answers given by the commentors.  You can see that we could probably check our work a bit better!   
There seems to be three sources of errors here. 
 (1) Assuming the nymph is the same size as the egg (clearly NOT true if you look at the video);
 (2) misreading the adult length as the nymph length (that's where the estimates of 65 and 75 mm come from);
 (3) eyeballing a length estimate off the image from the video.

(length estimates in mm)




 
==============
Additional reading:  Gurney, A.B. 1947. Notes on some remarkable Australasian walkingsticks, including a synopsis of the genus Extatosoma (Orthoptera: Phasmatidae). Annals of the Entomological Society of America 40(3): 373-396.

Wednesday, March 28, 2012

Wednesday search challenge (3/28/12): How long is it?

Today's challenge is exotic and amazing.  It seems prosaic when I state it, but the search really is worth the trouble.  Trust me on this one.  


A little background info to get you started.  


I'm interested in a particular insect, a kind of walking stick that lives in only one place in the world.  (On the fantastic island pictured below. It's a real place.) Except that wasn't always true.  It used to live on two islands side-by-side somewhere in the Pacific.  Thing is, a ship ran aground on one of the two islands causing this particular insect to become extinct.  Luckily, it went extinct on one of the islands and not the other.  




Why do you care?  Aside from the usual important issues around loss of biodiversity, the insect is extraordinary.  It's huge.  Scarily huge.  Jurassic park huge... and it's still around.  


But today's question is really two-fold.
1.  What was the name of the ship that nearly caused the extinction of this amazing insect?  (For extra credit, why would a ship running aground have such a dire consequence?)  

2.  How long is the body of the insect when it first hatches from the egg?  (This is also totally surprising.)  


As usual, be sure to let us know how you figured out the answers to these questions AND how long it took you! 

Tuesday, March 27, 2012

A do-over on Languages above and below the equator


As several readers pointed out in the comments, my analysis didn’t REALLY answer the question I’d asked.

They were correct.  Thanks for picking up on this. 

I found the languages by COUNTRY, then added up ALL the languages spoken in the country.  That’s not what I asked for!  I wanted the TOTAL number of unique languages spoken in each hemisphere. 

Okay.  Let’s do this again.  Better this time. 

This time I pulled the list of languages from the CIA World Fact book page on languages of the world.

I copied it from the web page and pasted it as plain text into a .TXT file.  This gave me a simple spreadsheet with country names in the left column and columns of languages to the right. 
From CIA Fact Book (see link above)
HOWEVER… I now had to do some data cleaning.  Many of the language entries had comments (e.g., percentages of speakers of that language in the country) scattered throughout the listing.  They’re useful for readers of the language list, but not helpful for the data analyst.

So I spent about 45 minutes cleaning up the data: removing percentages, deciding which small languages to keep (I mostly didn’t, but just called them all called “other”—sorry speakers of regional dialects). I alsot tried to canonicalize all of the variant spellings of different African languages, but I’m sure I didn’t get them all right.  (So there’s a slight overcount of African languages.) 

After about 1 hour total, I had a spreadsheet of countriesw/ a list of languages.  (I’ve shared these spreadsheets with you so you can see what I did.) 

I turned this into a Fusion Table of countries and languages spoken (so I could fuse it with my  already existing Fusion Table Country-LatLong


Then exported THAT fusion table as a regular spreadsheet so I could easily select the rows that are below the equator (that is, that have a latitude with an “S” in them). 


Now we’re getting someplace!

Next I sorted by latitude, then copied all of the languages BELOW the equator into a simple text file (which I then imported into a spreadsheet) so I could remove duplicates easily with a spreadsheet function.  In Google Spreadsheets, that’s the UNIQUE function.  This step gave me the Master list of Southern Languages

Did the same thing with the languages ABOVE the equator and created a spreadsheet Master List of Northern Languages.  


Once again I used the unique function to identify the unique languages. 

Now I can read off the numbers:  the Southern Hemisphere has 90 unique languages, while the Northern Hemisphere has 230 unique languages. 

Lesson learned:  Doing these kinds of analyses can sometimes be tricky, even when you write the question!

Monday, March 26, 2012

Quick comment: More to come on LANGUAGES below/above the equator

One of the great things about writing a blog like this is that there are so many smarter-than-you people out there!  


More than a few people wrote in to comment that either I'd done the analysis incorrectly (which is kinda true), and that I hadn't posted my spreadsheets in a public place (clearly true).  


A few wrote in to complain that this data point or that data point wasn't right.  And some of them have a point... but I don't think it will matter in the end.  


In any case, I'm travelling right now and wanted to let you know I haven't forgotten this analysis.  I'll try to get to it this evening, righting the wrongs in my spreadsheet and talking about why and how such analyses sometimes goes awry! 


Which leads to my best internet search source credibility heuristic ever: 


If a source (such as a newspaper or web site) never publishes and erratum, you know they don't care much about how right they are.  
Finding admissions and corrections of errors is a HUGE positive sign that the source is credible.  


Forever searching on! 



Friday, March 23, 2012

Answer: Are there more languages ABOVE or BELOW the Equator?


The question was:

     Are there more languages ABOVE the Equator or BELOW the Equator? 

As I said, a big part of answering questions like this is making your terms very clear.  When we decide to consider only “Official” languages, and not, say, every small language spoken in New Guinea (which has thousands) or every language spoken in central Africa (more thousands), we REALLY simplify the problem. 

What this lets us do is search for a list of “Official languages.”  So just doing a search like [official languages] quickly leads to the Wikipedia page of OfficialLanguages.  And that's not a bad place to start.  

But by looking around, I found that search ALSO leads to a languages-by-country table on Infoplease with more-or-less the same data in a somewhat more convenient form.  (Another search that works would have been [ list of languages by country ].) 

So I now have a table from Infoplease that looks like this:

To convert it into a form that we can manipulate, I just copied out the text of the table and saved it into a plain text file.  Handily, the copy/paste into a file automatically inserts TABs between the columns.  This will make it simple to import into a spreadsheet,  

Now I need to figure out which of these countries have their capital in the northern hemisphere.  So I THEN did a search for [ country capitals latlong ] and found another handy table of country capitals with their associated lat-longs! 

I did the copy/paste to get this data from their web page into my plain text file.  I then imported that into another spreadsheet.  

Now I’ve got two spreadsheets, one with a list of languages-by-country, and another with country-capitals and their lat-longs.



From the Lang-by-country spreadsheet I can easily write a spreadsheet function to count the number of languages in each country. 


A typical row looks like this:

    Afghanistan, Dari(Persian), Pashtu (both official),other Turkic and minor languages

The 199 languages of each country in this table are separated by commas.  So, I write a short spreadsheet function to count commas in each row, and—voila!—I have a table of Country / Number-of-languages.  (I’m happy to let the phrase “…other Turkic and minor languages” be counted as 1 extra language.  You'll see it won't matter in the end.)  By this method, I see that Afghanistan has 3 languages.  Yes, there are others, but there are only 2 official languages (and many non-official languages). 


In my other spreadsheet I have a table of 200 Country / Capital / Lat-longs. 

Obviously what I want to do is to use the Lat-longs to determine if the country is ABOVE or BELOW the equator, and then just count the number of languages in each group. 

This is exactly what Google Fusion Tables are designed to do.  

A Fusion Table can take two different tables that share a common element (in this case, the column labeled “Country”) and JOIN them together.  That’s 200 copy/paste operations I won’t have to do by hand. 

I created two Fusion Tables, one for each of my two spreadsheets. Using Fusion Tables "Merge" function, I just joined them together on their shared column "Country."  

 
I opened the Country-LatLong table and then FUSED it with the Country-Language-Count table.  This basically merges the two tables so that I now have new table with columns for Country / Lat-long / Number of languages! 

Once I have this, it’s an easy sort of the table to put all of the countries above the equator (that is, their Lat-long contains an “N” in the entry), and then write a simple =SUM(…) function to add them all up. 

Here’s what my table looks like now.  I converted the presence/absence of an “N” in the lat-long into a 1 (for ABOVE) or 0 (for BELOW).

And now we can just read off the answer: There are 546 official languages spoken above the equator, but only 164 spoken below.  That’s a big difference. 

I’ll leave it to you to speculate why that is.  But at least we’ve got a little data to go on.

And, since I was in Fusion Tables, it was trivial to visualize the data by mapping the number of languages onto the map-of-the-world and create this map.  The darker the green, the more languages spoken there.  As you can see, the north has a LOT more than the south. 



SEARCH LESSON:  Often you’ll find that data is out there on the web, but it’s in an un-handy form.  In this case, the information we wanted wasn’t immediately available.  But with a little data extraction, spreadsheet jockeying and using Fusion Tables to merge two data sets, we can get to what we want fairly quickly. 

Keep your eyes open for these data-merge possibilities.  Once you start looking around, you’ll see all kinds of merging that leads to insights about the world.  

Tuesday, March 20, 2012

Wednesday Search Challenge (3/21/12): Are there more languages above or below the equator?

We were talking the other day about how languages came into being. You know the stories--English grew out of the Anglo-Saxon dialects with dashes of other languages (especially French and Latin) added in.  Here's one view of how languages are scattered around the world: 
Image inspired by a language map from WikiMedia Commons
But then the discussion grew a bit more heated... Did languages arise because of geographic isolation, did they come about because of drift over time, or... what?  


After a few glasses of wine, various complex and learned theories were bandied about, and in the end we boiled it down to one really interesting search problem that we couldn't resolve with a quick Google search:  


     Are there more languages ABOVE the Equator or BELOW the Equator?  


A big part of answering questions like this is making your terms very clear.  For this question, let's consider only  Official languages (that is, languages recognized by the country) and not worry about relative sizes of speakers. We're just interested in whether or not countries below the Equator  have more languages than those above the Equator.  


To take an example, we know Switzerland has four "official" languages (German, French, Italian and Romansch).  Romansch has been recognized as one of four "national languages" by the Swiss Federal Constitution since 1938.


So, once you figure out what the languages of all the world's countries are, and divide them into languages spoken above-equator and below-equator, you'll be set to give the answer.  Notice that some languages (such as French or English) are spoken both above and below the Equator.


And for countries that straddle the Equator, let's go with the location of the capital as deciding if the country is above or below.  Thus, Ecuador (capital: Quito, at 0° 9′ 0″ S, 78° 21′ 0″ W) is in the southern hemisphere.  


Obviously, you don't want to do this one country or one language at a time, so the real question is... HOW will you do this?  


As usual, please let us know your answer, HOW you figured it out, and HOW LONG it took for you to figure this out!  


More above or below?  The ideal solution will actually determine some way of measuring this directly from data available on the web.  


Search on! 

Thursday, March 15, 2012

Answer: Where's that cat with the Mai Tai?

Short answer: This cat is a bronze statue sculpted by Vic ("Trader Vic") Bergeron.  It stands outside of McCone Hall on the UC Berkeley campus at lat/long 37.873763, -122.259545.  "Trader Vic" is the inventor of the Mai Tai rum drink.    (For what it's worth, that lat/long can also be written in old-school style as: 


      +37° 52' 25.55", -122° 15' 34.36"


And now... how to find it:  Here's what I really did...   Given that the picture was pretty high res, I just zoomed in and read the plaque at the bottom of the statue.  Duh.  But in a search challenge, every clue is fair game.  Once you zoom in enough you can read it was sculpted by Vic Bergeron, which seems like a pretty obvious clue.  A quick search for [ Vic Bergeron Mai Tai ] and you'll find many articles (including the Wikipedia entry about Trader Vic's) that argue that he was at least one of the two people who claimed to have invented the Mai Tai (the other person being Donn Beach, of "Don the Beachcomber" fame).  


Several readers noted that the mini-plaza around the statue seemed university-like, and that's a good clue as well.  But for a question like this, I always just start by asking the simplest possible question with a query like [ sabertooth cat statue bronze ] and then switch into Images mode to see what I could spot there.  Sure enough, I found the picture just a few rows down.  From that image I learned it was at Berkeley, so a quick second source search led me to a result from Berkeley.edu, a result that is a tour of McCone Hall, http://seismo.berkeley.edu/seismo/geotour/mccone.html -- which seems pretty substantially credible. It even included the reference to the Mai Tai:  
"...The saber-toothed cat, Smilodon californicus, whose likeness menaces the southern entrance to the building, is the official California State Fossil. It was sculpted by Victor Bergeron (of Trader Vic's fame) who is more widely known around the globe for his Mai Tai than for his sculptures..."  


I have to admit to being surprised by a few of the answers submitted by readers. Who knew, for instance, that doing a query for [Sabre Toothed Cat, University Drive, Berkeley, CA] in Google Maps would work?  Nice discovery.  


Using the lat/long from above, these are the best aerial images I could find.  





BTW, you CAN extract the lat/long from the link, but it's much better if you install the lat/long tool in Maps, then just right-click on any point in the map to capture the exact lat/long location.  


Most people reported finding this one in less than 10 minutes.  Next week, we'll go back to harder questions!  Get ready!  


Searching on! 

Wednesday, March 14, 2012

Wednesday search challenge (3/14/12): Where's that cat with the Mai Tai?

I was wandering around the Bay area and came across a remarkable statue.



Even more amazingly, once I saw it, I realized that this bronze cat has a deep connection to that rum concoction known as the Mai Tai.  


Given just the photos shown here, you should be able to figure out what the connection is. Can you?  What possible connection can there be between this statue and a rum drink? 


For extra credit, you should be able to easily determine the lat/long of the statue.  Can you find an aerial image of the cat?


Search on! 


(For all you advanced searchers, note that I've altered the image just enough so that "Search By Image" won't work here.  I thought I should tell you so you won't waste your time...) 

Thursday, March 8, 2012

Answer: Fishing over your head?

Short answer:  The submarine H-5 (USS-148) was at least occasionally used for fishing by submariners who would attach fishing line, hooks, bait, and floats to the top deck.  As the submarine trolled underwater, the fishing gear would do its work and the sailors would collect their catch upon resurfacing.  In this way, all of the fishing tackle was ABOVE the sailors, as were the fish themselves!  


And surprisingly enough, the  submarine was originally built for the Russian government!  But she was never delivered, and spent the rest of its days operating out of the San Pedro (California) sub base.






How to search for this:  I knew this was out there somewhere, and tried a bunch of combinations of searches [ overhead fishing ] then [ upside-down fishing ] and similar kinds of queries, ultimately leading nowhere.  I probably spent about 5 minutes trying out different combinations, getting nowhere.  


I then sat back in my chair and thought about the situation, trying to imagine what it might look like.  What set of circumstances could possibly lead to fish being overhead?  


Then I had an insight that was crucial.  Maybe the fishermen were in a submarine!  It sounded crazy to me, but I did the quick search [ submarine fishing ]  and discovered at the 9th result place a link to Fishing History... and fishing from subs


To tell the truth, this looked a bit too much like a bit of fake news.  (There are a number of people who fake old news stories to look just like this.)  So I really wanted to check the original source.  How could I do that? 


The article claimed it was from Popular Mechanics in the 1920s… so I just popped [submarine fishing Popular Mechanics 1920] into my search box and quickly found that Google Books HAS Popular Mechanics (but the article itself wasn’t showing up on the first couple of pages).  


So I went to Google Books and repeated my query. [Popular Mechanics submarine fishing 1920] Now that my query is scoped (that is, just limited to) Google Books (and therefore, Google Magazines as well), I found the original story by John Edwin Hogg from Popular Mechanics magazine, volume 34, 1920, p 591-592.  


Generally speaking, if you know a ship’s name you can usually find out its history.  (It also turns out there are a lot of sailing historians that build up ample documentation on the web.)  


So a quick search for [ submarine H-5 history ] quickly leads to a set of results about submarines named H-5.  That’s not unexpected, but I didn’t know there was ALSO a British submarine named H-5 as well.  (Be careful to not confuse the HMS H-5 with the US ship H-5, aka USS-148. The British sub sank while the USS H-5 was decommissioned and sold for scrap in 1933.)  


The Wikipedia article on the H-5 gets right to the point, and incorporates text from the History.Navy.mil web site, which is presumably the most authoritative source you can get on US naval vessels.  


The H-5 was built for the Russian Imperial navy during the First World War in 1915 at the Electric Boat company in Groton, CT (now known at the General Dynamics Electric Boat Company), but H-5 was never delivered due to the Russian Revolution of 1917. 


But now I was curious and I wanted to see what the submarine looked like, so I did a quick Google Image search for [ H-5 submarine







Amazingly, I found quite a number of photos of the H-5 on the Navy.mil site, including interior shots showing the torpedo tubes and the “fish” the H-5 carried.  Although the H-5 had “fish” (torpedos) onboard, that wasn’t the fish I was seeking.


And, if you visit the website, you’ll see that the photographs were taken by none other than John Edwin Hogg, the author of the Popular Mechanics article!  It seems that Mr. Hogg was an enterprising writer.  At least he did his research!  


Search lessons:  This week, the lesson is that you sometimes have to step back and re-think what it is you’re searching for.  Once I got the idea that the only way to fish OVERHEAD was from a submarine, I added that term to my search, and everything immediately fell into place.  As crazy as it sounds, yes, anglers can fish from subs.  Once you add that key term into the search, everything works. 


Search on! 









Wednesday, March 7, 2012

Wednesday Search Challenge (3/7/12): Fishing over your head?




One of the most remarkable stories in the annals of fishing is the tale of one particular boat that let anglers fish in a way that was completely unexpected.  While the fishermen relaxed or did other chores, their hooks, lines and floats would be well over their heads.  I don’t mean this metaphorically, I mean it literally—the fishing gear would be many tens of feet above them... and the fish as well!  

As remarkable as that is, the real question for today is this:  
There was one particular boat that became best known for this style of odd “fishing above the head.”  Can you find the name of that boat AND who the boat was originally built for

As usual, be sure to say what queries you used to find the answer and tell us how long it took you to discover the story behind "fishing over your head"! 

Search on! 

Monday, March 5, 2012

We must go deeper: Why search is always going to be tricky


In the movie “Inception,” the main character keeps goading his team farther into the subconscious mind with “we must go deeper…”  Their goal is to get to the bottom of the many layers of experience that makes up the human mind in order to plant a single idea. Each layer depends on the ones below it; they pile up to make an image of perceived reality.  And that’s kind of what I’m thinking here…

I often have this feeling of “we must go deeper” when I’m doing a simple search for information on Google—and yet I’m surprised when I see that other people don’t seem to have that same urge to understand what’s really going on, but are satisfied with the first answer that pops to the top. 

Somewhere along the way I must have developed a fact-checker’s outlook.  You know who the fact-checkers are: they’re the woefully underpaid people who read news, books, and magazine articles pre-press and meticulously verify that what’s written actually lines up with observable reality.  They’re paid to not just lookup the original publication of a fact, but then to also follow-up with the next few issues to see if any retraction or correction was published.  That’s what a real fact-checker does.  Running down truth, even if it takes several extra steps.

The simple reality is that looking stuff up has always been difficult.  Now, it’s just difficult in a different way (so many alternatives to consider, so many ways to think about things, so many ways to measure).  A fact-checker looks at a sentence like “There were 31 bars in Elko, Nevada in 1965” and immediately asks the question “How would I figure this out?  Would I count liquor licenses issued in the city of Elko during ALL of 1965, or just active licenses on 12/31/65?” 

When I talk to students as they search for simple facts, I find that they just want to get an answer that’s plausibly correct.  The actual truth of a fact doesn’t seem to matter as much as it confirms something previously known, or is merely close to something plausible.  It’s as though they live in an ongoing game of horseshoes where getting your token close to the post is often good enough to win.  

The thing that worries me these days is that not only do students not want to second source a reference (let alone look for retractions or corrections), but they often don’t want to understand the topic in enough detail to know how-and-why their fact might be incorrect.

The simplest example I can think of is the straight-forward question “What’s the circumference of the earth?”  How hard could finding that be?

 It IS the case that if you do a quick Google search, you can quickly end up with a very credible looking answer.  Here’s one answer: 24,901.55 measured around the equator (red arrow).  But here’s another:  24,859.82 miles if you measure around the poles (yellow arrow).  And of course, you can get different sets of number depending on who made the measurement, how it was made, and according to what model of the earth they’re using.

Now wait a second!  How does “model of the earth” enter into this discussion?  Isn’t the circumference an actual physical measurement? 

Well, yes, it is… but this is where you have to go deeper and realize that nobody actually takes the measurement.  Think about it for a second:  Walking around the equator with a yardstick just doesn’t work for all kinds of practical reasons.  So to actually measure the circumference, you’d actually measure something that’s easier to do—say, the difference between two vertical angles measured at two points on the surface.  (That is, if you project a vertical ray up from a point and see what star it hits, then walk north 100 miles and project a second vertical ray upwards, you can measure the difference in the angles and figure out what angle is subtended.  Once you know the diameter from other means, with this measured angle you can work out the circumference of the earth.) 

But here’s the tricky part:  If you measure some fraction of the earth’s surface, you then drop that into a MODEL of the earth to compute the circumference.   If you assume the Earth is a sphere, then it’s just simple geometry.  But  the truth is, you have to know that the earth isn’t a sphere.  (I assume you know that!)  But if it’s not a sphere, then what IS it?  That’s where the model comes in. 

Newton thought the earth was an oblate spheroid (a sphere that’s been squashed down a bit) with a flattening of 1:230 from truly spherical.  He derived that ratio from his model of gravity, and brilliant though it was to figure that out, it’s not quite right.  Measurements later showed that the flattening is more like 1:210.

Except… we must go deeper

The model of Earth as oblate spheroid assumes that it’s a “body of revolution.”  That doesn’t mean it suffers from constant political uprisings, but means the model assumes it’s pretty smooth, as though it were turned on a lathe.  Of course, you know by now that nothing is that simple.  Earth is actually fairly lumpy, with bumps on it—think Rocky Mountains or Himalayas—or low spots like Death Valley.  So a better model is complex ellipsoid with various undulations that make it NOT a body of revolution. 

The real question is probably this: Why should I care?  The two numbers given above (24,901.55/ 24,859.82) are within 1% of each other.  Does that matter? 

If you’re launching a spacecraft, it matters.  But the real point is that when searching for something as straightforward as the “earth circumference,” there’s a backstory about the number that reveals a good deal.   (And personally, the backstory is often more interesting than the final number… but I digress.) 

Thing is, this is true for most things you look up.  Suppose someone asks “what’s the biggest city on Earth?”  A quick Google check shows that it’s Shanghai with 17,836,133 people.  Really?  Since the time you started reading this, some people died in Shanghai while others were born.  What does the instantaneous population of a city even mean? 

And so checking the backstory quickly leads to clarifying questions: “What do you mean by biggest?”  Are you measuring population or area?  Do you mean city boundary as determined by the city council or by measuring the urban core and excluding suburban regions? 

This is, of course, what reference librarians do—they conduct the “reference interview” to try and pin down the variable parts of someone’s question. 

When I was doing a field study at the University of Alaska (Anchorage),  I actually played the part of reference librarian for a couple of hours while the snow fell and it grew dark at 3PM.  Mostly the questions were “where’s the bathroom” (no backstory needed for that!) and “how do I get my printer to work?” 

But then a graduate student asked for a “list of the top ten journals in Sociology.”  He wanted to get a general awareness of what the field was about and what would be reasonable topics of current interest. 

Getting a list of sociology journals isn’t hard.  Rank ordering them according to some reasonable function is tough.  “What would you like?”  I asked.  “Readership size?  Citation rates?  Agreed-upon influence?  Most-often-checked-out?”  Those all produce somewhat different sort orders.

Thing was, he didn’t really care.  He didn’t need an ordered list—just ANY list of 10 good journals would be fine, thanks very much.

I’d just assumed that he wanted an ordered list.  That’s what working at Google does to you; I see the world as a ranking problem and just assumed he wanted a list with an evaluation.  But since he was going to spend as much time with journal #1 as with journal #10, it really didn’t matter. 

Any good psychologist will tell you that “the presenting problem is often not the root problem.”  Likewise, the first question is often not the real question.  Good searchers take a problem and worry on it a little.  Good searchers look behind the “answer” to understand how that answer came to be.  In the circumference case, it was the measurement method.  In the population problem, it was how “largest” was defined and then how that property was measured as well. 

If I could somehow give everyone a piece of search advice, I’d say “think like a fact-checker” and get one or two steps deeper into your topic matter.  You must go deeper in order to understand what it is you think you’ve found.  The fact you seek isn’t just a fact, but something based on a set of choices about how to measure and what to report. 

Of course, I can say this all I want.  But this approach hasn’t worked well in the past.  (People have been saying this for a long, long time.)  How can I help people see this is what they really want to do when they really want to understand a topic? 

Ideas? 

Thursday, March 1, 2012

Answer: Impossible, yet simple?


Short answer to the challenge... The Egg of Columbus

(But read on for the rest of the story!) 

This search challenge was harder than I thought… but for an interesting reason.  I’ll take the blame here—the question (“English 4-word idiomatic expression for something thought to be impossible, but shown to be trivial”) is not exactly a cleanly defined, clearly answerable kind of question! 

As a consequence, people found all KINDS of answers that I hadn’t thought about. 

   Forest for the trees
   Cutting the Gordian knot
   Thinking outside the box
   Keep it simple stupid
   Hiding in plain sight
   To square the circle
   If it were a snake it would have bit me  (so close… but too long!)
   Elephant in the room
   Needle in a haystack

And even some unusual ones:

   Riddle of the Sphinx
   L'spirit de esclair

Alas, while many are close, they’re not exactly right.  “Cutting the Gordian knot” has the right sense of a sudden rapid solution to an unsolvable problem. 

Thing is, I hadn’t thought of that phrase as a possible solution.  So kudos to those of you who made all of these suggestions.  I’d put “Cutting the Gordian knot” as a correct answer, with  “Forest for the trees” and “thinking outside the box” as decent half-credit answers.  (They don’t saying anything about the solution, just the path to the clever solution.) 

As you can see, writing a nice, hard search problem that has a neat, instantly recognizable solution is tricky. 

Out of the roughly 20,000 who read this problem, only 3 found the answer I planned:  JacobM, Johns1111 and Amarek all found what I’d INTENDED which is such problem/solutions are called “The Egg of Columbus”--as in the sentence "That's a real 'Egg of Columbus' type problem.'  

JacobM solved the problem by searching for  [ seems | looks impossible ]  (where the vetical bar symbol means OR).  He found some idiom collections and flipped through them until finding the right one.  This is a really good approach!  

In the comments, Johns1111 points out that in German this would be “das Ei des Kolumbus,” which is a transliteration of “The Egg of Columbus,” and, in my opinion, quite correct.  Congrats also to Amarek for also finding  the same solution. 

The Back Story to this problem:  This problem was originally given to me by a friend who knew that such a phrase existed, but he couldn’t think of what it was.  If it makes you feel any better, I worked on-and-off on this question for about 4 weeks, trying new ways to search for it… basically looking for a way to rephrase the question so that the results would contain the answer. 

After trying lots of queries (example:  [ phrase simple solution hard problem ] or [ idiom for sudden answer discovery ] )  I finally tried the query:  [ discovery easy "after the fact" ]

My reasoning was that someone writing about this idiom would have to use the phrase after the fact and use that phrase in the context of discovering  an easy  solution.  That's how I chose my search terms.  

Luckily, it worked out well.   But truthfully, only after a lot of systematic search on my part.  


The Wikipedia entry for "Egg of Columbus"  has the following explanation for the phrase which I like. 

The Egg of Columbus story might have begun with the Italian historian Girolamo Benzoni. In his book History of the New World (1565) he wrote: Columbus was dining with many Spanish nobles when one of them said: 'Sir Christopher, even if your lordship had not discovered the Indies, there would have been, here in Spain which is a country abundant with great men knowledgeable in cosmography and literature, one who would have started a similar adventure with the same result.' Columbus did not respond to these words but asked for a whole egg to be brought to him. He placed it on the table and said: 'My lords, I will lay a wager with any of you that you are unable to make this egg stand on its end like I will do without any kind of help or aid.' They all tried without success and when the egg returned to Columbus, he tapped it gently on the table breaking it slightly and, with this, the egg stood on its end. All those present were confounded and understood what he meant: that once the feat has been done, anyone knows how to do it. 
Image of Columbus breaking the egg, by William Hogarth.
Original in National Portrait Gallery, London.  This image from Wikipedia.

Whether or not the story is true, it has entered into the world as an idiomatic expression for "seemingly impossible to solve, but actually trivial."  An excellent idiom. 

Search lessons(1) First, as should be obvious, finding idioms is hard.  People who searched in idiom dictionaries were on to something.  That’s a great strategy for searching out this kind of information.  I tried a couple before giving up, hoping to stumble across it (which is what happened).   

(2) You sometimes have to search for the right query.  I tried a lot of queries in my search (at least 100 over the 4 week period) before I found the right combination of search terms that led me to the answer.  I was really interested (which is why I dedicated so much time to the problem). 

My big tip about long searches is pretty simple:  Keep track of what you’ve searched for and found.  That’s especially true for long search processes that take weeks.  I just kept an open text file with my “search notes” in it.  It really prevented me from duplicating my searches and let me systematically eliminate possibilities.  

Search on!  (But systematically...)