Sing a Song of Evolution

Is language descended from musical mating calls?

By Jaron Lanier|Tuesday, August 01, 2006
RELATED TAGS: EARLY LANGUAGE

Lngwidge iz a straynge thingee. You can probably read that sentence without much trouble. Sentence also not this time hard. You can screw around quite a bit with both spelling and word order and still be understood. This shouldn't be surprising: Language is flexible enough to evolve into new slang, dialects, and entirely new tongues.

In the 1960s, many early computer scientists hoped that human language was a type of code that could be written down in a neat, compact way, so there was a race to crack that code. If you could decipher it, then a computer ought to be able to speak with people! That approach turned out to be extremely difficult, though. Automatic language translation, for instance, never really took off.

In the last five years or so, computers have gotten so powerful that it has become possible to shift methods. A program can look for correlations in large amounts of text. Even if it isn't possible to capture all the language variations that might appear in the real world (like the oddities I used at the start of this column), a sufficiently huge number of correlations eventually yields results.

For instance, suppose you have a lot of text in two languages, like Chinese and English. If you start searching for sequences of letters or characters that appear in each text under similar circumstances, you can start to build a dictionary of correlations. That can produce significant results, even if the correlations don't always fit perfectly into a rigid organizing principle, like a grammar.

Such brute-force approaches to language translation have been demonstrated by companies like Meaningful Machines, where I was an adviser for a while, and more recently by Google and others. They can be incredibly inefficient, often taking 10,000 times as much computation as older methods, but we have big enough computers these days, so why not put them to work? Set loose on the Internet, such a project could begin to erase language barriers. Even though language translation is unlikely to become perfect, it might get good enough—perhaps within this decade—to make countries and cultures more transparent to one another.

These experiments in linguistic variety could also inspire a better understanding of how language came about in the first place. One of Charles Darwin's beautiful evolutionary speculations was that music might have preceded language. He was interested in the way many species use song for sexual display and wondered if human vocalizations might have started out that way too. Perhaps vocalizations became varied and complex only later, when song was applied to topics other than mating and such basics of survival.

Language might not have entirely escaped its origins. Since you can be understood even when you are not well-spoken, why bother being well-spoken at all? Why is there a magazine editor being paid to improve this text? Perhaps speaking well is still, in part, a form of sexual display. By being well-spoken I show not only that I am a clued-in member of the tribe but also that I am intelligent and likely to be a successful partner and helpful mate.

Only a handful of species, including humans and certain birds, can make a huge and ever-changing variety of sounds. Most animals, including our great-ape relatives, tend to make the same patterns of sound repeatedly. It is reasonable to suppose that an increase in variety of human sounds had to precede, or at least coincide with, the evolution of language. Which leads to another question: What makes the variety of sounds coming from a species increase?

As it happens, there is a well-documented case of song variety growing under controlled circumstances. Kazuo Okanoya of the Riken Institute in Tokyo compared songs between two populations of birds: the wild white-rump munia and its domesticated variant, the Bengalese finch. Over several centuries, bird fanciers bred Bengalese finches, selecting them for appearance only. Something odd happened during that time: Domesticated finches started singing an extreme and evolving variety of songs, quite unlike the wild munia, which has only a limited number of calls. The wild birds do not expand their vocal range even if they are raised in captivity, so the change was at least in part genetic.

The traditional explanation for such a change is that it must provide an advantage in either survival or sexual selection. In this case, though, the finches were well fed, and there were no predators. Meanwhile, mate selection was done by breeders, who were influenced only by feather coloration.

Enter my friend Terry Deacon, a scientist who has made fundamental contributions in widely diverse areas of research. He is a professor of anthropology at the University of California at Berkeley and an expert on the evolution of the brain; he is also interested in the chemical origins of life and the mathematics behind the emergence of complicated structures like language.

Terry proposed an unconventional solution to the mystery of Bengalese finch musicality. What if there are certain traits, including song style, that naturally tend to become less constrained from generation to generation but are normally held in check by selection pressures? If the pressures go away, variation should increase rapidly. Terry suggested that the finches developed a wider song variety not because it provided an advantage but merely because in captivity it became possible.

In the wild, songs probably had to be rigid in order for mates to find each other. Birds born with a genetic predilection for musical innovation most likely would have had trouble mating. Once finches experienced the luxury of assured mating (provided they were visually attractive), their song variety exploded. Brian Ritchie and Simon Kirby of the University of Edinburgh worked with Terry to simulate bird evolution in a computer model, and the idea checked out.

Recent successes using computers to hunt for correlations in giant chunks of text offer a fresh hint that an explosion of variety in song might have been important in human evolution. To see why, compare two popular stories of the beginning of language.

In the first story, a protohuman says his first word for something—maybe ma for "mother"—and teaches it to the rest of the tribe. A few generations later, someone comes up with wa for "water." Eventually the tribe has enough words to have a language.

In the second story, protohumans have become successful enough that more of them are surviving, finding mates, and reproducing. They are making all kinds of weird sounds because evolution allows experimentation to run wild, so long as it doesn't have a negative effect on survival. Meanwhile, the protohumans are doing a lot of things in groups, and their brains start correlating certain distinctive social vocalizations with certain events. Gradually, a large number of approximate words come into use. There is no clear boundary at first between words, phrases, emotional inflection, or any other part of language.

The second story seems more likely to me. Protohumans would have been doing something like what big computers are starting to do now, but with the superior pattern-recognizing capabilities of a brain. While language became richer over time, it never became absolutely precise. The ambiguity continues to this day and allows language to grow and change. We are still living out the second story when we come up with new slang, like bling or LOL.

Even if the second story happened, and is still happening, language has not necessarily become more varied. Rules of speech may have eventually emerged that place restrictions on variety. Maybe those late-arriving rules help us communicate more precisely or just sound sexy and high status, or more likely a little of both. Variety doesn't always have to increase in every way.

Variety could even decrease over time. In fact, there may be a bizarre example of that happening right now in human song. We can easily explore the changing amount of variety in songs over the last hundred years because of an amazing data archive: audio recording. Since the beginning of recorded music, the sound of human song has changed with each new generation of people. There's no confusing a 1930s song with a 1940s song, or a 1950s song with a 1960s song. The pattern sticks until roughly the end of the 1980s. It's not easy to tell whether a song came from 1990 or 2000.

This might sound like an extraordinary claim, but you can test it yourself. Listen to random clips from the many sources of songs available on the Internet and don't peek at the year they were produced. You'll discover that it's harder to date songs from the last two decades than songs from previous decades. Terry and I are now considering this experiment on a more formal basis.

If you accept that there has been a recent decrease in stylistic variety in human song, the next question is "Why?" There are plenty of possibilities: Maybe the Internet makes too much information available, so everyone has the same influences to absorb—and songs lose flavor and take on a generic quality. To be more cynical, it could be a sign of cultural decline.

Another explanation, which is the one I suspect, is that the change since the mid-1980s corresponds with the appearance of digital editing tools for music. Digital tools are more suggestive about results than previous tools: If you deviate from the kind of music a digital tool was designed to make, the tool becomes difficult to use. For instance, it's far more common these days for music to have a clockwork regular beat. Some of the most widely used music software becomes awkward and can even produce glitches if you vary the tempo much while editing. In predigital days, tools also influenced music, but with not nearly such a sharp edge.

So this is an ironic moment in the history of computer science. We are beginning to succeed at using computers to analyze data without the constraints of rigid grammarlike systems. But when we use computers to create, we are confined to equally rigid 1960s models of how information should be structured. The hope that language would be like a computer program has died. Instead, music has changed to become more like a computer program.

Next Page
1 of 2
Comment on this article
ADVERTISEMENT

Discover's Newsletter

Sign up to get the latest science news delivered weekly right to your inbox!

ADVERTISEMENT
ADVERTISEMENT
Collapse bottom bar
DSCDecCover
+

Log in to your account

X
Email address:
Password:
Remember me
Forgot your password?
No problem. Click here to have it emailed to you.

Not registered yet?

Register now for FREE. It takes only a few seconds to complete. Register now »