Words & Numbers

Epilogue

2018-02-07T21:35:00.000-08:00

This blog was active from 2005 to 2012. If you’ve found it long after, the best place to start is Best Of.

If you are looking for a work update beyond the last post below, 2012’s “At Responsys,” please check my LinkedIn profile.

Thanks for visiting—I hope you enjoy the words and numbers herein!

At Responsys

2012-09-11T21:10:00.001-07:00

An update from the professional front: I have joined Responsys as SVP Product Management.

Responsys is a software-as-service provider for interactive marketing: It helps companies communicate with customers via interactive channels like email, web, social, mobile, and display ads. As people increasingly spend time online, these channels are where marketers want to be. Plus, compared to traditional channels like television or print, in which you get the same message as everyone else, interactive channels promise to be more targeted and personalized—so you get what’s relevant for you.

Having been in the interactive marketing field since the beginning, I believe in this promise. It is good for companies and customers alike. The challenge is to make it real. Responsys is a leader in doing so, having gone public in 2011 (ticker symbol MKTG) after growing straight through the late 2000s recession.

So, I’m excited to help grow something that already has substantial scale, in a market that keeps renewing itself with new channels and technologies. If you want to join me, we have many opportunities across the company for like-minded people.

Back in the Bay Area

2012-09-03T21:34:00.000-07:00

I knew I was back in the Bay Area when:

On the second day, exiting Highway 92, I was behind a Google self-driving car.
My new town’s waste-collection system gave me three cans: a big one for recycling, a big one for composting, and a small one for garbage.
I was walking along a trail, wearing an E*Trade hat I randomly acquired in the past. A guy walking the other way asked, “Do you know what E*Trade closed at today?”

Yes, after four-plus years in West Hartford, Connecticut, we–my wife, daughter, and I–are back. As I and others have said before, West Hartford is great. But for us, the Bay Area is home.

Omnivark’s Time and Place

2012-07-26T06:46:00.000-07:00

Today saw the final edition of Omnivark, my personal project to teach a computer to identify great writing. Each day, Omnivark would pick three pieces of new, nonfiction writing on the Web, plus a book. I’m proud of Omnivark’s quality during its six-month run. (Feel free to click a random day from the archive, and see what you think.)

So why stop now? Omnivark was something different and fun to do during the past months while I was also doing consulting. However, that period was an in-between time, from when I completed the Intelligent Cross-Sell integration at RichRelevance until my family’s move back to the Bay Area, which is happening in August. Soon after, I’ll be resuming my normal career—more on that in a future post.

Suffice to say, Omnivark was of a certain place and time, which are changing. I enjoyed creating it; I hope you enjoyed reading it. If you did, here are some suggestions for alternatives to keep your tank topped with great reads.

And finally, for those interested in how the technology worked:

Why Omnivark? summarizes my motivations and the technology.
Following the Elites discusses the challenge of who/what to follow to find great writing.
Stylistic Signals explains how Omnivark determines the great in great writing.
Omnivark’s Division of Labor explores the roles of human and computer during the Omnivark project.

Omnivark’s Division of Labor

2012-07-26T06:25:00.001-07:00

As I taught a computer to recognize great writing, the division of labor between me (the teacher) and the computer changed over time. The Omnivark software started relatively stupid and ended relatively smart.

At the beginning, I was heavily influencing Omnivark’s daily picks of great writing on the Web. This was necessary to create a training set of great writing for Omnivark’s algorithms, so they could learn to find other great writing. Over time, and a lot of experimentation, Omnivark became smart enough so it could do most of the work in choosing great reads for an edition.

But to be clear, there was a big distance between Omnivark’s doing most versus all of the work. Most of the work was Omnivark’s finding and scoring hundreds—sometimes over a thousand—of candidate pieces a day. I still needed to pick the best of Omnivark’s best, factoring in issues like diversity of topics and sources.

Unless I was working on the algorithms, I would need to read only perhaps ten of the top-scoring candidates. From them, I often was able to find two of the three Web picks for an edition. I’d find the third pick by scanning further down the list of candidates for an interesting headline, or I would find it from my own normal Web browsing, or I’d occasionally get a great suggestion from an Omnivark reader.

The Omnivark algorithms were capable of rating not only an entire piece but also individual sentences. Once in a while (maybe one in ten times), I’d agree with Omnivark’s choice for the best sentence to use as a quote from the piece. The low agreement was due to Omnivark’s simply judging a sentence for artfulness, whereas I was also judging how well a sentence indicated what the piece was about. Also, I could easily see when multiple sentences, or fragments of sentences, were better than the best sentence—a far from easy task for software.

The fourth and final pick in every Omnivark edition was from a book. This turned out to be a distinct challenge because book excerpts are often in PDFs or special viewer applications such as Amazon’s “Look Inside.” Automating their extraction was different enough from everything else I was doing that I ended up picking the books manually, guided by high user reviews and official endorsements.

So, except for the book picks, the division of labor shifted nicely from human to computer. In terms of replacing a human’s hours, Omnivark got maybe 90% of the way there. However, that last 10%’s hours are a lot harder to automate than the first 90%’s. They involve creative judgment such as knowing when different picks go well together, or recognizing that a certain sentence captures the essence of a larger point. Perhaps someday a computer will do that too, but there will always be the need to model specific humans’ judgments; otherwise, it would be like having the same editor for every magazine. In that sense, humans will always be the teachers.

Stylistic Signals

2012-06-21T12:43:00.000-07:00

As Omnivark trawls the Web for new, great writing, it has two distinct tasks. First, where does it find the candidates—the articles, essays, the blog posts—that might be great writing? My previous post, Following the Elites, was about this challenge.

Second, once Omnivark has a set of candidates, how does it know which few are great? For example, given an entire issue of The New Yorker, what is the best thing in it?

The New Yorker’s editor might say it’s all great. And different readers will surely have different opinions of what’s best. So to clarify: In this case best means most like the structure and style of other great reads. (The other great reads were classified as such by a human expert.)

Note that we are comparing texts’ forms, not their topics. So, given a great read about a boar-hunting congressman, Omnivark will try to find more pieces that are written like that, as opposed to more pieces about boar-hunting congressmen.

This is an important distinction. Most text-analytics systems do topic-matching (find more boar-hunting congressmen). Omnivark is about style-matching. Omnivark will measure a new piece of writing against the characteristics of great writing that Omnivark has already modeled. Those characteristics include statistical, semantic, and structural properties of the text. Some examples:

Simple statistical properties include the text’s total number of words, the average numer of words per sentence, and the average number of sentences per paragraph. These simple metrics are better for filtering-out the bad than discerning the best among the good. However, more complex metrics (such as the ratio of nouns to adjectives) resonate with certain writing styles.
Semantic properties refer to the meanings of the words used. This is tricky because we want to capture how word choices correlate with style but not with topic. We don’t care that boar appears a lot in the boar-hunting piece; we do care about the artful usage of certain adjectives, adverbs, and other flavoring words, the use of which makes the prose more expressive.
Structual properties include how sentences and paragraphs are put together. For example, the use of balanced or parallel phrases is an indicator of expressive writing, as is the use of similes and metaphors. Detecting these structures in a general way is hard.

In the world of search engines like Google, these properties are called signals. Omnivark’s job is to know the signals that best predict great writing. As an extra twist, because great writing takes different forms, Omnivark needs to employ different configurations of signals.

Behind the scenes, I built a tool that makes exploring for signals relatively easy. A new signal can be tested in real time on a set of training texts diverse in style and quality.

For me, this exploration for stylistic signals is the most interesting part of creating Omnivark. Having taught writing, I have reasonably good instincts for prose quality. However, knowing it when you see it is different from generalizing that knowledge into a computer. In practice, it’s easy to identify signals that find great writing but also find a lot of mediocre writing too. It is much harder to find the signals that cleanly discern the best from the rest.

Following the Elites

2012-06-13T10:26:00.001-07:00

In a perfect world, Omnivark’s software would read everything published on the Web each day, then pick the best three “great reads.” That perfect world is not available. But can we find a more practical path to the same results?

With Omnivark, I’ve explored several approaches. In this post, I will focus on the most obvious and, it turns out, cost-effective: embrace elitism. By that I mean track the top publications where the top writers appear. You can argue whether the list of publications should be 20 or 200 long, but either way it’s nothing compared to the millions of other entities—minor publications, blogs, Tumblrs, Quora postings, and such—that comprise “everything.”

The Atlantic Wire’s “Five Best Columns” daily newsletter exemplifies this approach. It appears to draw from a short list of usual suspects: The New York Times, The Washington Post, and a handful of other top newspapers and highbrow magazines/Websites. The results are quite good.

With Omnivark, I use a much wider array of inputs, and the algorithms ignore a piece’s source. (In a similar vein, by intentionally omitting the source publication’s name from the preview quotes, the Omnivark site encourages readers to judge the preview quotes by their quality, not by where they come from.)

Still, Omnivark ends up with a lot of material from that same group of usual suspects. The reason is, true to reputation, they are venues where superb writing appears in volume. This combination of quality and quantity is hard to beat.

As support, consider Longreads, a crowdsourced site that highlights new, long-form nonfiction. Anybody can nominate a piece from anywhere, usually via the Twitter hashtag #longreads. But despite the potentially wide spectrum of nominations, the site’s official picks are still mostly from elite publications.

I doubt the Longreads editors are suppressing non-elite stuff; if anything, I suspect they welcome the chance to boost something obscure yet worthy. But I also suspect most of the (non-spammy) nominations are for pieces in elite publications because of the quantity/quality reason above.

Plus, when nominations are an open process, another factor helps the more popular, elite publications like The New York Times or The New Yorker. They have thousands of times more readers (and Twitter followers) than smaller publications or independent bloggers. So if the same quality of piece appears in the typical blog and The New Yorker, the New Yorker piece will have thousands of times more potential nominators.

All this goes to say that curating just from the elite publications is a good bang-for-buck strategy. It exploits the concentration of high-quality material in relatively few places.

And if you want to take it a step further but keep the bang-for-buck efficiency, you can also track the elite writers directly, such as by following on Twitter. That way, you can catch his/her work outside the elites without needing to trawl for it generally. Byliner.com seems to take this approach, as well as commissioning its own pieces.

In theory, an additional benefit of following elite writers is that they can recommend good stuff by other writers. In practice, it works a little, but writers in elite publications often just recommend other stuff in elite publications. Perhaps an apt analogy is with Major League Baseball players, who can talk all day about other MLB players but don’t think as much about what’s happening in the minor leagues.

Of course, this just makes me want to focus more on writing’s equivalent of the minor leagues—the non-elite venues where good stuff lurks deeper and more dispersed. However, if the goal is to surface great writing, today’s lesson is that much of it is already near the surface, in the elite publications where it’s expected to be. Distilling the best of that best is valuable, as the Atlantic Wire’s newsletter and Longreads show. The open question is, how much extra value is there in plumbing the depths further?

McMeasure It

2012-05-29T09:50:00.001-07:00

Well into Manohla Dargis’ New York Times dispatch from the 2012 Cannes Film Festival is a word worth savoring, McMeasured:

The festival’s prejudice toward — or, more generously, its loyalty to — favorite auteurs has been routinely held against its programmers, as if filmmakers and their works should only be McMeasured by the millions and billions served.

It’s quality versus quantity in a single, artful word.

Why Omnivark?

2012-05-07T07:37:00.000-07:00

When I introduced Omnivark a few months ago, many people asked, politely: “Why?” (Quick recap of what it is: The Omnivark Web site helps users discover great writing. Each daily edition highlights three new, nonfiction pieces on the Web, plus a recommended book.)

Omnivark is not about me clicking around the Web all day looking for great writing; it’s about teaching a computer to do that. I did not previously mention the computer’s role because I wanted people to evaluate Omnivark for its content, not its process.

Behind the scenes, the process includes software programs that sift thousands of new Web pages per day, looking for a rare gem. The problem is, physical gems have standardized measures of clarity, cut, and size. The written word lacks equivalent measures, especially to discern great writing from good writing. (Quantifying bad from good is more tractable.)

Lack of measures does not mean lack of agreement about greatness—for better or worse, there are widely acclaimed publications, writers, and pieces. The problem with measuring greatness is the diversity of ways writing can be great. Hemingway’s terseness and Faulkner’s complexity are opposites, yet they are both literary legends from the same era. A gentle eulogy, a political rant, an ironic cultural commentary—should they be judged with the same scorecard? And if great writing transcends mere communication to accomplish something higher, isn’t that beyond the realm of a scorecard?

For me, cutting into this thicket of questions is fun. However, it’s the type of fun suited to a personal project, where walking the path can be the reward. I say that because it’s unclear how far, or where, the path can go. Emulating a human editor’s expert judgement of great writing—based on its content, not on source or popularity or social filtering—is technically hard, if not conceptually quixotic. But that’s what makes it fun. And that, in turn, is the answer to “Why?”

The Fastest Human in History

2012-04-10T07:15:00.001-07:00

A small voice said, “Don’t let me fall, daddy.”

She was on the bike, wobbly, her confidence gone with the training wheels. I was holding her, gently pushing her forward.

“I’m falling!”

“I’m still holding you.”

“Hold me tighter or I’ll fall!”

■

I don’t remember learning to ride a bike. I only remember the moment of transition, when I realized I was doing it. The memory has no visual component, but I imagine my father trailing off behind as I self-propelled forward.

■

On October 14, 1947, a B-29 bomber dropped test pilot Chuck Yeager from 20,000 feet. Yeager was in the Bell X-1, a rocket with wings. Clear of the B-29, Yeager lit the engines.

The X-1 shot upward an additional 20,000 feet, accelerating to 0.92 Mach, 92% of the speed of sound. Then the shaking started.

Other pilots had hit this resistance, which they called the sound barrier. It got worse as you got closer to the speed of sound—how much worse at the extreme, no one knew.

The shaking intensified as the Machmeter read 0.93, 0.94, 0.95, 0.96. The X-1 engineers built the plane for this, but even they didn’t know exactly what this would be. The only way to find out was to go there.

Yeager did, as the X-1 blew through its own shock waves, past the speed of sound. A sonic boom echoed across the desert. Inside, Yeager recalled, it became so smooth that “Grandma could be sitting up there sipping lemonade.” At that moment, he was the fastest human in history.

The B-29 that launched the X-1 trailed off, mission accomplished.

■

We had already done the preliminaries: scooting the bike with her feet, coasting a bit from a small push, and pedaling as I jogged along holding her. All fine. But we were stuck at my letting go while she kept pedaling.

“I don’t want to fall!”

I convinced her it was okay for me to let go a few seconds at a time as she pedaled. Yet when I tried to stretch the counts, she would put her feet down, her shoes skidding the bike to a stop.

She knew she needed to keep pedaling, that more speed meant more balance. But knowing and doing were different things.

■

Amid growing frustration, a friend of hers happened by. A recent success story on two wheels, the friend had a simple statement: If you want to do it, you can. With that, the friend rode off matter-of-factly.

It was the right message, from the right messenger, at the right time. As she watched the friend ride away, I could see my daughter reframing the problem in her mind. It was no longer about wanting to learn, like at school; it was about wanting to graduate.

In our next pass down the street, she pedaled faster. She trusted me to let go as long she was staying up, allowing my catches to steady her as she continued pedaling. She was beginning to instinctively adjust the front wheel for balance.

Then I was hands-off for five, ten, fifteen strides. “You’re doing it! Keep going!”

She did, accelerating.

I kept running with her, a few steps back. In the retelling, I imagine myself trailing off as she self-propels. At that moment, in our little world, she is the fastest human in history.

New Name, Look, and Features

2012-04-01T12:24:00.002-07:00

After 5+ years and nearly 300 postings, this blog is getting a new name, look, and features.

The name, “Words & Numbers,” is what I would have called it from the beginning, had I known what this blog would be about. But I discovered its aboutness along the way.

The new look and features come with a change of blog platform, from TypePad to Google’s Blogger. Most of the features are minor improvements, such as better support of mobile devices and social sharing. However, I also took the opportunity to redo the topic labels, improve the typography, and add a Best Of section.

Finally, for people who follow via RSS: Sorry for the old items in your RSS reader. The platform change caused that. If you mark everything read, all will be back to normal going forward.

Intelligent Cross-Sell: The CNET Years

2012-03-29T08:04:00.000-07:00

After integrating ExactChoice into CNET.com, my main task was to create something new for CNET. That became Intelligent Cross-Sell, a product used by four of the top ten brands in the Internet Retailer 500, among others.

I was part of CNET Channel, since renamed CNET Content Solutions. Its customers are e-commerce sites that sell technology and consumer-electronics products. Its primary product is a detailed database of products. E-commerce sites use this database to display products and specs in a standardized way. For example, if you see a product page for a computer on CDW.com, much of the page’s content is actually from CNET.

Circa 2005, having attracted a large number of e-commerce customers worldwide, CNET was looking for something new to sell them. My job was to determine what it should be and then to build it with my own team.

The industry term for this role is intrapreneur. It can mean anything from “leader of a CEO’s pet-project skunk works” to “random guy building something not elsewhere classifiable on the org chart.” In my case, I was fortunate to have both a specific place in the org chart and a high degree of autonomy. I also had strong executive support.

By choice, I worked as part of a two-person team, with my ExactChoice partner Howard Burrows. We knew how to explore concepts quickly and cost-efficiently, having practiced what today would be called lean-startup techniques since founding ExactChoice in 2002.

At the outset, I talked with dozens of CNET customers about their e-commerce businesses, looking for the pain points we could reasonably address, ranking them by risk and reward. The opportunity that kept winning was a tool to automate cross-selling. Although everyone was familiar with Amazon.com’s “people who bought this also bought that,” tech and consumer-electronics sites could not use it to determine, for example, the right carrying case with a computer.

Among the challenges with “people who bought this also bought that” were:

If a few consumers mistakenly bought the wrong-sized case for a computer, the algorithm would start recommending the bad combo, causing a slew of returned products.
It was useless for new products without sales history—no people who bought this, then no people who bought that.
It left no room for merchandising. For example, as computers began appearing with the Bluetooth wireless standard, cross-selling Bluetooth mice made sense. But how could merchants tell the algorithm to do that when it was only looking backward at the non-Bluetooth past?

Because of these issues, many large tech and consumer-electronics sites were using humans to manually configure cross-sells. These sites had tens or hundreds of thousands of products, changing rapidly. The humans could not keep up. We would later attract two of our early customers—billion-dollar e-commerce sites—by showing them their percentages of empty cross-selling slots.

The beauty of the opportunity was that it played to CNET’s strength. The CNET product database, DataSource, had the size of most computers. It also had the size capacities for most carrying cases. A trivial math operation could prevent a sizing mismatch. This is what the humans were doing in their heads, one product combination at a time. This is what we could do nearly instantly, across an entire product catalog.

In addition to preventing bad cross-sells, we could also enable good ones: Bluetooth mouse to Bluetooth computer? No problem. Match the mouse’s brand with the computer’s brand? Easy. CNET’s database had more than 100 million product attributes to fuel such rules, which would emulate how a person intelligently chooses cross-sells.

Of course, the system would measure itself, so we would have additional data about each product’s sales, its effectivness as a cross-sell, even its behavioral performance in “people who did this also did that.” I liked that, because attribute-driven rules and behavioral data were together likely better than either approach separately.

Finally, the system would need to support hands-on use by merchandisers. Rules would be customizable, in a drag-and-drop way. And reports would link back to rules, so a merchandiser could see which rules caused which numbers.

That was the vision for Intelligent Cross-Sell. We announced the product in February 2006 and released it later that year, with paying customers.

By the first release, we could already see Intelligent Cross-Sell was substantially increasing customers’ cross-selling revenue. We later did case studies with Office Depot and Dell that reported a doubling of cross-sell and upsell revenue. (Upsells are another type of production recommendation that Intelligent Cross-Sell does. Whereas a cross-sell offers a carrying case with a computer, an upsell offers a better computer in place of the one you are considering. When doing this, Intelligent Cross-Sell can automatically generate “pitch text” based on an analysis of each computer’s specs, such as “Faster processor and 50% more storage.”)

Although we hit the market as the housing-bubble-induced recession was starting, we managed to get a decent core of customers in the 2007 to 2009 timeframe. By 2010, among our customers were four of the top ten brands in the Internet Retailer 500’s list of e-commerce sites. We had also gone international, at sites in the United Kingdom, France, Germany, and Denmark. Later, we reached sites in Sweden, Norway, and the Baltics.

As we grew Intelligent Cross-Sell’s revenue, we hired a small team to help evolve and support the product. Things were good in our little world.

But 2010 was a turning point. In the previous few years, several venture-funded startups had emerged as competitors, each with vastly more resources than our small group. They had all started with “people who did this also did that” technology, applying it not just to tech and consumer electronics but to all e-commerce categories. Although Intelligent Cross-Sell was still superior for cross-selling tech and consumer-electronics products, the best start-ups were using their greater resources to offer a broader set of capabilities, with cross-selling and upselling being just one aspect.

We knew the game was changing when, in mid-2010, two customers who had been highly satisfied nevertheless defected to other vendors. The other vendors simply offered more stuff. It was like being a bakery in a town that starts getting supermarkets. Our bread was better, but we didn’t have a deli counter or a produce aisle.

We could have adapted by becoming even more specialized, like an artisanal bakery of cross-selling. But it would have been hard to do within CNET, which had become CBS Interactive when the media giant CBS acquired CNET in 2008. As an enterprise software-as-service player, we were already an outlier of a business within CNET, more so for CBS. I did not want to make us even more marginal. So I concluded that everybody—CBS/CNET, the Intelligent Cross-Sell team, our customers—would be better off if we could partner with one of the other players whose only business was doing what we did.

The right match turned out to be RichRelevance, the personalization company with, by far, the most blue-chip customer base, as well as the most complementary approach to the market. In the partnership, RichRelevance would run the Intelligent Cross-Sell technology and employ the team; CNET would license its data and provide sales collaboration. The best supermarket would now offer artisanal bread.

The deal proved to be a win for everyone. For me, having spent five years on the product, having built a team without ever losing an employee, and having worked directly with every customer, I wanted Intelligent Cross-Sell to continue on the best footing possible. It did, and still is.

Meet Omnivark

2012-01-24T01:53:00.000-08:00

Along with taking some time off and doing consulting, I’ve been working on a new project:

Omnivark is a highlight reel for the written word. Each weekday we short-list the best new writing on the Web—the kind of writing that delivers such surprise and delight that you feel bad for not having time to find or read it. ;)

Omnivark creates that time for you. It fits the best stuff into an idle moment on your mobile phone or tablet or computer.

From writers famous to obscure, on topics familiar to foreign, Omnivark curates the well-said for the well-read.

I’ll be explaining more about the motivations and technology behind Omnivark soon. In the meantime, please check it out if you’re so inclined. (And if I may tilt your inclination, see last Friday’s edition, first entry, which you are statistically likely to appreciate.)

You can get Omnivark each weekday free via email, Twitter, Facebook, or RSS.

Just Follow the Signs...

2011-12-01T13:34:00.000-08:00

[From the corner of Market and Morgan in Hartford, CT]

Nuclear Weapons and Murphy’s Law

2011-11-10T12:23:00.000-08:00

Murphy’s Law says, anything that can go wrong will go wrong. In 1958, as the Cold War’s nuclear-arms race was accelerating, researchers at the think tank RAND worried that something—the ultimate thing—could go wrong with a nuclear weapon.

By that time, at least a dozen nuclear-weapon mishaps had occurred, including accidental drops, jettisons, and crashes. Due to technical and human safeguards, the nuclear material did not detonate. But the researchers saw ways the safeguards could fail or be intentionally defeated. Thus the question: Could Murphy’s Law go nuclear?

The researchers’ report, “On the Risk of an Accidental or Unauthorized Nuclear Detonation,” was declassified in 2000 and is now on the Internet. It is an interesting example of how to think about the risk of something happening when it has not happened before.

Normally, risks are associated with odds, and odds are based on past observations. For example, during the 1950s, the U.S. Air Force’s B-52 bomber had a number of accidents. Dividing that number by the total B-52 flight-hours gave odds of one accident per 25,000 flight-hours.

Lacking a record of nuclear-detonation accidents, the researchers could not calculate odds in the same way. Zero divided by anything would be zero.

The RAND researchers argued the actual risk was not zero. They cited numerous plausible scenarios in which technical flaws, human errors, sabotage, or some combination of these factors could cause a nuclear detonation. The bad scenarios were all highly unlikely, but no one knew how unlikely. In contrast, it was certain that the likelihood of an accident was increasing with the number of nuclear weapons.

The researchers also saw increasing risk in a key trend of the time, having more planes on continuous ground alert, or staying continuously aloft, with nuclear weapons ready to strike. This trend would greatly increase the number of flight-hours in which an accident could occur, as well as opportunities for various other human mistakes.

Finally, the researchers delved deeply into the possibility that an insider could deliberately override safeguards in an act of nuclear sabotage. Precedents existed for non-nuclear saboteurs, including military personnel with mental disorders. Against this backdrop, the researchers noted that many then-current nuclear weapons could be detonated singlehandedly by an individual with the right access and knowledge.

In response to these scenarios, the researchers recommended new efforts to develop technical and process safeguards to further reduce risk without sacrificing readiness. For example, the researchers suggested a lock for nuclear weapons, the combination for which would only be transmitted with the order to use the weapon.

The researchers also praised the idea of an acceleration switch, then under development, that would prevent a weapon from detonating while being handled on the ground. To illustrate the value, the researchers cited training incidents that would have caused a nuclear detonation if they occurred in the field.

Unlike many research reports, this one influenced the highest levels of decision-making. As told in Sharon Bertsch McGrayne’s The Theory That Would Not Die, the Commander of the U.S. Air Force’s Strategic Air Command, General Curtis LeMay, ordered new safeguards for nuclear weapons because of the report.

(McGrayne’s book is a popular account of the historical uses of Bayesian probability, a techique that incorporates degrees of subjective belief in addition to direct observations. The Bayesian approach can be useful when there aren’t enough observations to analyze or when the observations have uncertainties. Some of the statistical analyses in the RAND report used a Bayesian approach, which was unusual for the time.)

Since the early 1960s, the United States has continued to improve nuclear-weapons safeguards, not just due to research reports but also due to close calls. For example, in 1961 an air accident plunged two hydrogen bombs into a North Carolina field. One of the recovered bombs only had a single safeguard—out of six—remaining to prevent a nuclear detonation. Other accidents also avoided a detonation but spilled dangerous nuclear material.

Compared to early nuclear weapons, modern nuclear weapons have far stronger safeguards. They include a more sophisticated version of the combination lock suggested in the RAND report, physically requiring two people to unlock; arming components designed to fail under adverse conditions such as a crash, thus making them “fail safe”; and special types of conventional explosives and containment devices to prevent leakage of nuclear materials in an accident.

In addition to having safer weapons, the United States now has far fewer nuclear weapons deployed, on lower levels of alert, than during the height of the Cold War. So, the RAND researchers (Fred Charles Ikle, Gerald J. Aronson, and Albert Mandansky) would be pleased.

I am pleased too. Reading their report reminded me of the time I toured a decommissioned Titan II nuclear-missile silo in Arizona. Although it was a relatively low-tech artifact of the 1960s, I was impressed with how well considered its design and operating procedures were. It felt like those involved were up to the enormous responsibility attached to their jobs. That included everyone from the thinkers at RAND to the systems designers to the hands-on crews.

May they all continue their success, in the United States and wherever else Murphy’s Law and nuclear weapons could meet.

Witold Rybczynski’s One Good Turn

2011-10-17T06:41:00.000-07:00

When you see a bucket of screws at the hardware store, you probably don’t think of them as technology. After reading One Good Turn by Witold Rybczynski, you will think different.

Rybczynski argues that the screw was an exceptionally creative solution to the problem of fastening things. To illustrate what we take for granted today, he provides this capsule history of another fastener:

[A] useful device that secures clothing against cold drafts, [the button] was unknown for most of mankind’s history. The ancient Egyptians, Greeks, and Romans wore loose tunics, cloaks, and togas. Buttons were likewise absent in traditional dress throughout the Middle East, Africa, and South Asia. True, the climate in these places is middle, but northern dress was likewise buttonless. Eskimos and Vikings slipped their clothes over their heads and cinched them with belts and straps; Celts wrapped themselves in kilts; the Japanese used sashes to fasten their robes. The Romans did use buttons to ornament clothing, but the buttonhole eluded them. The ancient Chinese invented the toggle and loop, but never went on to the button and buttonhole, which are both simpler to make and more convenient to use. Then, suddenly in the thirteenth century in northern Europe, the button appeared. Or, more precisely, the button and the buttonhole. The invention of this combination—so simple, yet so cunning—is a mystery. There was no scientific or technical breakthrough—buttons can easily be made from wood, horn, or bone; the buttonhole is merely a slit in the fabric. Yet the leap of imagination that this deceptively simple device required is impressive. Try to describe in words the odd flick-and-twist motion as you button and unbutton and you realize just how complicated it is. The other mystery of the button is the manner of its discovery. It is difficult to imagine the button evolving—it either exists or it doesn’t. We don’t know who invented the button and the buttonhole, but he—more likely she—was a genius.

I have quoted at length because the passage is a miniature version of the book. Replace button with screw, and you’ve got Rybczynski’s thesis: Whereas nails came from spikes, which were crude and obvious implements, the screw came from what? Its key feature—and the source of its superior holding power compared to a nail—is the helical thread that winds around the shaft. The helix was neither obvious to conceive nor easy to implement in materials.

Like a genealogist tracing older and older descendants, Rybczynski searches for evidence of the earliest screws and screwdrivers. He profiles key innovators along the way, such as those who created the precision machine tools necessary for mass-manufactured, standardized screws; or inventors that improved on the flathead screw, namely Phillips’ x-shaped socket and Robinson’s square socket. The patent wars of yesteryear were about such things.

As much as One Good Turn is about screws, screwdrivers, and other tools, it is also about an intellectual quest. Unsatisfied with the literature on the subject, Rybczynski narrates his way through libraries and museums, each holding clues to the further history of the screw. He assembles new evidence of screws as fasteners in the Middle Ages. Then he keeps going in search of the ur-screw, back to ancient Greece.

Like the societies that had the button but not the buttonhole, the Greeks (and later the Romans) had the screw but not for fastening. Rather, the Greeks had large-scale helical screws for mechanical use. It was there and then that Rybczynski believes the original insight of the helical screw occurred, likely by the great engineer Archimedes.

So the next time you think about technology and a computer comes to mind, One Good Turn will remind you that technology has a far longer thread back through history.

Changing Gears

2011-10-06T02:38:00.000-07:00

Having reached a good outcome with CNET Intelligent Cross-Sell’s transition to RichRelevance, I will be taking this opportunity to switch gears: I am going into independent-consultant mode. That will include remaining on the RichRelevance team in a consulting capacity. I will be working with a few other consulting clients as well. I will also be using the flexibility of consulting to reserve some time for myself.

I realize that some people say they’re doing consulting as a euphemism for looking for a job. To be clear about my situation, consulting is currently what I want my job to be. Having been deep into a single thing for five years, with very high-stakes customers, I’m ready to come up for air. And if I’m coming up for air, I might as well breathe deeply. ;)

I’m fortunate to have clients right out of the gate, but I am always happy to hear of interesting opportunities where a little bit of my expertise and abilities can have significant impact. The areas I’m covering are product design, product marketing, strategy, company evaluations for M&A and venture capital, and advising middle- to later-stage startups on internal innovation for “next act” products (those that come after the core product that the entire company has been built around).

Feel free to drop me a line if there’s a connection or discussion to be had.

CNET Intelligent Cross-Sell @ RichRelevance

2011-10-06T02:35:00.000-07:00

Six months ago, CNET Content Solutions announced a strategic partnership with RichRelevance regarding Intelligent Cross-Sell, the product I co-created and the team I led at CNET. This blog post by RichRelevance’s CEO, David Selinger, describes what Intelligent Cross-Sell does and how it adds value to RichRelevance’s product line.

Other than posting a link to the CNET-RichRelevance announcement on Twitter, I didn’t say much about the deal when it was announced. My feeling was, I’ll talk about it when we’ve accomplished something more than announcing the partnership. Now is the time.

Having spent six months working closely with RichRelevance, I am pleased with the result: 100% of the customers are transitioned, the technology is migrated, the ICS team is at RichRelevance, and we’ve already done deals for new customers as part of RichRelevance. Meanwhile, CNET Content Solutions continues to bring product data, industry expertise, and sales support to the partnership. It’s a win/win as both sides now benefit from a bigger Intelligent Cross-Sell business than would have been possible from CNET alone.

In the next post, I’ll say what this means for me going forward.

My Perestroika

2011-09-25T13:15:00.000-07:00

My Perestroika is a documentary film about the lives of five Russians. They were children of the Soviet Union, which fell down as they grew up.

Now middle-aged, some have ridden the waves of change; others have treaded water. The film splices their present-day selves with their home-movie pasts with their uncertain futures.

I thought it was superb, but you can judge the trailer for yourself:

Simon Winchester’s A Crack in the Edge of the World

2011-09-17T06:58:00.000-07:00

Call it coincidence, but as I was writing about em dashes and long sentences lately, I was reading Simon Winchester’s A Crack in the Edge of the World. If you like long, lyrical sentences, festooned with em dashes, read on.

The geology of the northern half of California—whether we are talking about San Francisco Bay or the Central Valley, the Coast Range or the Sierra, the Monterey headlands or the cost of Humboldt country or Mount Diablo itself—is all interlinked, subtly confusingly and, for the geological mapmakers, often maddeningly. These links go far beyond the borders of the state—political lines that pay no heed, in this case, to the absolutes of geology. They spread far, far beyond—as we shall discover, they reach up to Alaska, they percolate across to Wyoming and Montana, they reach back west across two oceans as far, in fact, as India and Australia. One might say, indeed, that the story of what makes California so complex and so interesting and so dangerous—and what makes Diablo so similarly geologically alluring—has implications for, and connections to, the planet in its entirety.

It’s a marathon of a paragraph, but I like its layered, controlled complexity. Yet taken too far, that style can delineate itself to death:

[The basalts] spilled over and laid themselves down on the old Pangaea-Columbia-Arctica-Ur granitelike continental rocks that exist underneath, making the confection of geology that—in juxtaposition with all the ice and snow of climate and the storms and winds of weather, the polar bears and lichens of biology and the Eskimo and Inuit and Danes and Americans soldiers of anthropology—constitutes the great and mysterious island known today as Greenland.

The good news is, that specimen is the exception, not the rule. Winchester’s long and winding sentences are usually a pleasure to parse. Like poetry, they require more of you, but they reward the effort.

The same can be said of A Crack in the Edge of the World’s subject matter. The 1906 San Francisco earthquake is the center of a narrative constellation that includes a roadtrip across North American geologic points of interest, a people’s-eye history of California from Gold Rush to early twentieth century, a seminar on geologic science, and ruminations about the fragility of human existence versus nature. Consult the 27-page index for other topics covered.

Put another way, if books were beer, A Crack in the Edge of the World would be an earthy, flavorful stout: gaggingly thick for some people, satisfyingly rich for others. I found myself in the latter camp.

Friends and Strangers

2011-09-11T11:46:00.000-07:00

On the tenth anniversary of 9/11, I’ve been thinking about two stories. First, there was Abe Zelmanowitz, a worker at the World Trade Center. His co-worker and best friend, Ed Beyea, was a quadriplegic. As the building burned and people streamed down the stairwells, Abe stayed with Ed on the 27th floor landing. Because Ed weighed nearly 300 pounds, they were waiting for a rescue team to safely carry him.

Ed had a health aide, Irma, in the building. Although she found the two men, Irma was having trouble breathing from the smoke. Abe told her to go, that he would stay with Ed.

Both men called their families to say they were okay. Abe’s mother pleaded with him to get out while he could. He stayed.

Abe and Ed died in the building that day.

A family member recalled of the two, “If Ed was going to make [dinner] arrangements, he’d make sure it was kosher, and if Abe was going to make the arrangements, he’d make sure it was wheelchair-accessible. They always had each other’s best interests at heart.”

The second story is about Mike Benfante, who was also working in the World Trade Center. During the evacuation, he found wheelchair-bound Tina Hansen. He was a stranger to her, but Mike and colleague John Cerqueira carried Tina down 68 flights of stairs, often in darkness and smoke, sloshing through areas flooded by building sprinklers. It took 90 minutes. The building collapsed five minutes after they got out.

As the tenth anniversary of that day approached, Mike deflected attention from his heroism to the larger lesson of what 9/11 summoned in friends and strangers alike: “I’ve learned that 9/11 showed us that there are enormous, untapped reservoirs of extraordinary human kindness and generosity just waiting for a trigger, that this trigger should be pulled daily as most of us are basically good people.”

The Long and Short of Average Sentence Length

2011-08-07T04:26:00.000-07:00

For an earlier post, I analyzed the text from 616 articles in Slate’s sections “The Good Word” and “Books.” The purpose was to answer a question about the use of em dashes, but since I had all 697,422 words at the ready, I asked another question: Of all the articles, which had the longest and shortest average sentence length?

For context, the average sentence length across all articles was 25.4 words per sentence. The winner for longest average sentence length was nearly twice that. The shortest was about a third less.

And now, the drumroll please....

The winner for longest average sentence length, at 49.7 words per sentence, was Daphne Merkin’s review of Decca: The Letters of Jessica Mitford. Here is the opening paragraph:

Although it is not uncommon for big families to produce a rebel or two along with the chip-off-the-old-block offspring, there are few that can lay claim to as much dissension within the ranks as the aristocratic clan of Mitford. This gaggle of wayward sisters (six in all, with one brother, Tom, who was killed in combat in 1945 at the age of 36) included Diana, the family beauty, who married the dastardly Oswald Mosley, head of the British Fascist party; Nancy, the family wit, whose novel The Pursuit of Love kick-started the proliferation of novels, memoirs, and biographies that would come to be called the Mitford “industry”; and the family madwoman, Unity, who went bonkers for Adolf Hitler and put a pistol to her head when Britain declared war on Germany.

Compare and contrast with the winner for shortest average sentence length, Jason Sokol’s commentary on The Presidential Recordings: Lyndon B. Johnson: Mississippi Burning and the Passage of the Civil Rights Act: June 1, 1964-July 4, 1964. Sokol’s average of 16 words pers sentence was less than the length of the title. Here is the first paragraph:

President Lyndon Johnson, domineering and manipulative, lives on in American memory as the classic power broker. He bullied opponents, sweet-talked skeptics, and chewed out subordinates. He oozed confidence as he passed one piece of landmark social legislation after another, even as his cockiness helped to mire the country in Vietnam. Yet this is not the Johnson who emerges from volumes seven and eight of The Presidential Recordings, a transcription of his phone conversations from June 1 to July 4 of 1964.

My purpose is not to claim one of these examples is better than the other. They are both well-crafted paragraphs. But side by side, they are a reminder of how stylistically diverse good writing can be.

Citizen U.S.A.

2011-07-31T10:33:00.000-07:00

Fellow Americans: If the dysfunctional circus of Washington, DC, is getting you down, let me suggest an hour’s worth of relief: an HBO documentary film called Citizen U.S.A.

Filmmaker Alexandra Pelosi visited citizenship-induction ceremonies in all fifty states, interviewing new citizens. Some originally came as refugees, some snuck in and later got amnesty, some were students, and some were standard immigrants. The lucky ones were pursuing happiness. Many, especially women, were pursuing more basic needs like living in safety, speaking freely, or being able to work to support themselves. You will feel good about what America has done for them, as well as what they are doing for America.

The stars include New York City coffee cart guys from Afghanistan, a Buddhist monk in Utah, a nuclear scientist at Los Alamos National Lab, a Nigerian paralympics athlete in Kentucky, Iraqi refugees transplanted to Nebraska, and a Mulsim mother with a dream to take a cruise to Alaska. (Most of these descriptions came from the film’s synopsis at the HBO site. See also the trailer on YouTube.)

Citizen U.S.A. is currently available on Comcast’s On Demand service, but you need to be an HBO subscriber. I assume the film will appear soon on Netflix and other venues. Be on the lookout.

The Use and Abuse of the Em Dash

2011-07-23T08:30:00.000-07:00

In Slate, Noreen Malone makes The Case—Please Hear Me Out—Against the Em Dash. She says it undercuts good writing, yet writers are using it more. To make her point, she oversalts her own prose with the em:

The problem with the dash—as you may have noticed!—is that it discourages truly efficient writing. It also—and this might be its worst sin—disrupts the flow of a sentence. Don’t you find it annoying—and you can tell me if you do, I won’t be hurt—when a writer inserts a thought into the midst of another one that’s not yet complete?

Having thus revealed the em dash’s peril, Malone later concludes, “Leave the damn em dash alone.”

I suggest not. The em dash is a good thing, albeit the kind where too much good is bad. As we say in software development, that is a feature, not a bug.

Of the em dash’s many uses, the main one is to set off a phrase with greater emphasis. Used in tandem—as here—em dashes are like commas or parentheses, only more assertive. Used alone, an em dash heightens what comes next—more drama if no comma.

Em dashes are effective for emphasis because they are rare. Use them too much and you defeat their purpose, as Malone demonstrates with her wanton em dash abuse. But is today’s writing increasingly like that? Malone asserts such a trend but caveats that it’s “just anecdotal observation; I admit I haven’t found a way to crunch the numbers.”

Here’s a way to crunch the numbers: Extract the text of hundreds of articles published in Slate from 1996 to 2011. Focus on the sections “The Good Word” (where Malone’s article is filed) and “Books.” They seem like good candidates for the at-risk writerly behavior that Malone fears.

When I did that, I found 616 articles through the end of June 2011, totaling 697,422 words. Because different years had widely varying amounts of articles, I split the articles into two periods: 1996 to 2004 and 2005 to 2011.

The earlier period had 7.6 em dashes per thousand words; the later period had 7.8. That difference is noise. Malone’s peers are not spiraling into an abyss of increasing em-dashery.

So despite Malone’s concerns, I suspect that Slate’s writers are using the em dash to good effect. They know that with punctuation, as with salt, an occasional dash will do you good.

David Eagleman’s Sum

2011-06-26T16:18:00.000-07:00

“In one afterlife, you may find that God is the size of a microbe and unaware of your existence. In another version, you work as a background character in other people’s dreams. Or you may find that God is a married couple, or that the universe is running backward, or that you are forced to live out your afterlife with annoying versions of who you could have been.”

That is from the back cover of David Eagleman’s Sum, subtitled “forty tales from the afterlives.” Each tale is a vignette about what happens when you die.

Eagleman is a neuroscientist, and Sum is a literary mind game. With imaginative what-ifs, he subverts familiar conceptions of life and death. Instead of a singular light in the dark, you get a light show.

It’s quirky, adventurous, and at times eloquent. It’s a virtuoso performance of thinking different. It’s also admirably brief.

If Sum sounds interesting, The New York Times has an excerpt with four vignettes. And here is the book’s page at Amazon.