[FoRK] The Man Who Wrote 200,000 Books

Jeff Bone <jbone at place.org> on Mon Apr 14 07:50:45 PDT 2008

		
Speaking of bots (or perhaps in this case, "cyborgs" would be more  
appropriate)...

   http://www.nytimes.com/2008/04/14/business/media/14link.html?_r=2&dpc&oref=slogin&oref=slogin

--

April 14, 2008

LINK BY LINK

He Wrote 200,000 Books (but Computers Did Some of the Work)

By NOAM COHEN
It’s not easy to write a book. First you have to pick a title. And  
then there is the table of contents. If you want the book to be  
categorized, either by a bookseller or a library, it has to be  
assigned a unique numerical code, like an ISBN, for International  
Standard Book Number. There have to be proper margins. Finally,  
there’s the back cover.

Oh, and there is all that stuff in the middle, too. The writing.

Philip M. Parker seems to have licked that problem. Mr. Parker has  
generated more than 200,000 books, as an advanced search onAmazon.com  
under his publishing company shows, making him, in his own words, “the  
most published author in the history of the planet.” And he makes  
money doing it.

Among the books published under his name are “The Official Patient’s  
Sourcebook on Acne Rosacea” ($24.95 and 168 pages long); “Stickler  
Syndrome: A Bibliography and Dictionary for Physicians, Patients and  
Genome Researchers” ($28.95 for 126 pages); and “The 2007-2012 Outlook  
for Tufted Washable Scatter Rugs, Bathmats and Sets That Measure 6- 
Feet by 9-Feet or Smaller in India” ($495 for 144 pages).

But these are not conventional books, and it is perhaps more accurate  
to call Mr. Parker a compiler than an author. Mr. Parker, who is also  
the chaired professor of management science at Insead (a business  
school with campuses in Fontainebleau, France, and Singapore), has  
developed computer algorithms that collect publicly available  
information on a subject — broad or obscure — and, aided by his 60 to  
70 computers and six or seven programmers, he turns the results into  
books in a range of genres, many of them in the range of 150 pages and  
printed only when a customer buys one.

If this sounds like cheating to the layman’s ear, it does not to Mr.  
Parker, who holds some provocative — and apparently profitable — ideas  
on what constitutes a book. While the most popular of his books may  
sell hundreds of copies, he said, many have sales in the dozens, often  
to medical libraries collecting nearly everything he produces. He has  
extended his technique to crossword puzzles, rudimentary poetry and  
even to scripts for animated game shows.

And he is laying the groundwork for romance novels generated by new  
algorithms. “I’ve already set it up,” he said. “There are only so many  
body parts.”

Perusing a work like the outlook for bathmat sales in India, a reader  
would be hard pressed to find an actual sentence that was “written” by  
the computer. If you were to open a book, you would find a title page,  
a detailed table of contents, and many, many pages of graphics with  
introductory boilerplate that is adjusted for the content and genre.

While nothing announces that Mr. Parker’s books are computer  
generated, one reader, David Pascoe, seemed close to figuring it out  
himself, based on his comments to Amazon in 2004. Reviewing a guide to  
rosacea, a skin disorder, Mr. Pascoe, who is from Perth, Australia,  
complained: “The book is more of a template for ‘generic health  
researching’ than anything specific to rosacea. The information is of  
such a generic level that a sourcebook on the next medical topic is  
just a search and replace away.”

When told via e-mail that his suspicion was correct, Mr. Pascoe wrote  
back, “I guess it makes sense now as to why the book was so awful and  
frustrating.”Mr. Parker was willing to concede much of what Mr. Pascoe  
argued. “If you are good at the Internet, this book is useless,” he  
said, adding that Mr. Pascoe simply should not have bought it. But,  
Mr. Parker said, there are people who aren’t Internet savvy who have  
found these guides useful.

It is the idea of automating difficult or boring work that led Mr.  
Parker to become involved. Comparing himself to a distant disciple  
ofHenry Ford, he said he was “deconstructing the process of getting  
books into people’s hands; every single step we could think of, we  
automated.”

He added: “My goal isn’t to have the computer write sentences, but to  
do the repetitive tasks that are too costly to do otherwise.”

In an interview from his home in San Diego and his offices nearby, Mr.  
Parker described his motivation as providing content that the  
marketplace has otherwise neglected for lack of an audience. That can  
mean a relatively obscure language is involved, or a relatively  
obscure disease or a relatively obscure product.

Take, for example, the study of bathmats in India.

“Only one person in the world may be interested in that,” he conceded,  
“probably a strategic planner for a multinational that makes those.”  
But he points out that once he has trained the computer to take data  
about past sales and make complex calculations to project future  
sales, each new book costs him about 12 cents in electricity. Since  
these books are print-on-demand or delivered electronically, he is  
ahead after the first sale, he said.

His company, the Icon Group International, is the long tail of the  
bell curve come to life — generating significant total sales by adding  
up tens of thousands of what might be called worst sellers. For  
example, a search at the Galter Health Sciences Library of the  
Feinberg School of Medicine at Northwestern University found half a  
dozen Icon books, mainly in the library for patients and their families.

Icon is “a very innovative and interesting example of print on  
demand,” said Kurt Beidler, a senior manager at Amazon.com who runs  
the publishers’ services for BookSurge, Amazon’s print-on-demand  
company. “A lot of examples of print on demand take older books and  
bring them back — really acting as a supply-chain tool. In this kind  
of business, it’s a new business, using this capability to introduce  
new material to customers.”

Mr. Parker compares his methods to those of a traditional publisher,  
but with the computer simply performing some of the scut work. In an  
explanatory YouTube video, Mr. Parker shows a book being created. The  
computer is given an assignment — project the latent demand for  
antipsychotic drugs around the world, based on the sales figures in  
the United States.

“Using a little bit of artificial intelligence, a computer program has  
been created that mimics the thought process of someone who would be  
responsible for doing such a study,” Mr. Parker says. “But rather than  
taking many months to do the study. the computer accomplishes this in  
about 13 minutes.”

An editor picks the years to be covered, but the computer picks the  
optimum model for extrapolating sales in various countries, and in  
alphabetical order produces a chart for each country. “It will then  
open a Word document and export the information into Word just like a  
real author would out of their minds, so to speak, or spreadsheets,”  
he says.

Artificial intelligence researchers say computers are far from being  
what the general public would consider authors.

“There is a continuous spectrum, also known as a slippery slope,  
between a program that automatically typesets a telephone directory  
and a program that generates English texts at the level of variety you  
would expect from a typical human English speaker,” said Chung-chieh  
Shan, an assistant professor in the computer science department of  
Rutgers. “The former program is easy to write, the latter program is  
very difficult; in fact, the holy grail of linguistics. Like Mad-Libs,  
Parker’s programs probably lie somewhere between the two ends of this  
spectrum.”

Mr. Parker has lately taken to lighter fare intended to educate. He  
said he had invested “up to seven figures into the animation business”  
for word-based video games and animated game shows that will teach  
English to non-English speakers. YouTube has many examples of these  
games, which have computer- generated scripts.

A low-tech version of those games are the thousands of crossword  
puzzle books Mr. Parker has made in about 20 languages. The clues are  
in a foreign language and the answers are in English. The computer  
designs the puzzles and ensures that the words become harder as one  
progresses.

As part of his love of words, and dictionaries in all languages, Mr.  
Parker said he has taken to having his computers create acrostic poems  
— where the first letter of a series of words spells a synonym of  
those words, often to ironic effect.

Of course, one of the difficulties of generating a hundred thousand  
poems is stepping back and assessing their quality.

“Do you think one of them is Shakespeare?” he was asked.

“No,” he said. “Only because I haven’t done sonnets yet.”

--

jb



More information about the FoRK mailing list