The Machine That Changed The World — Transcription of the interview with Alan Kay (Part 1)

Tech Life

MCTW Kay interview d1

Introduction

This was an interview conducted for the same The Machine That Changed The World documentary series featuring the interviews with Larry Tesler and Steve Jobs I recently transcribed and published here. This interview took place over two days in July 1990. Only portions of it were featured in the documentary series.

I’ve always been an admirer of Alan Kay and his work. As I was watching this interview I realised there were so, so many things worth taking note of, and worth sharing, that I decided to carry out a full transcription of it. As you can see, it was an energy-draining, time-consuming task, but I’m happy to have done it. There’s so much food for thought here that it’s a veritable banquet.

For a comprehensive look into the series, I recommend checking out the excellent work by Andy Baio in 2008 on his waxy.org website.

About the interview

The video of the interview can be watched here. [Update, March 2022 — The original YouTube link doesn’t work anymore. You can watch the interview here instead. Note also that the video is accompanied by a transcript on the WGBH website, but it doesn’t look accurate in certain places; maybe it is an automated transcription?]

This is the full transcript of the interview. As mentioned above, the interview was recorded over two days, so it’s quite long (about 2 hours and 45 minutes in total). Therefore I thought it was best to split it in two parts, one for each day. This is Part 1.

I’ve applied gentle editing in some places to make the context of certain questions, and the meaning of some convoluted passages a bit more understandable. Understanding how the interviewer formulated his questions was sometimes hard, due to the low volume (he didn’t sound as he was miked), and to the fact that his remarks could be somewhat meandering.

Topics include the evolution of the computer and its role in society (past, present, and future), user interfaces, Doug Engelbart’s famous demonstration, the FLEX machine, the computer as a medium (including parallels with, and excursions on, the evolution of the book), the experience at Xerox PARC, Steve Jobs, Apple, the Macintosh UI in relation to what was pioneered at PARC, the Alto, the Dynabook concept, working with children, the stages of development and the ways to learn about the world, interfaces and application software, the universal machine, the computer interface as user illusion and the concept of virtual machine, the future of computing, virtual reality, and general considerations on the evolution and the revolution of the computer. Summarising these is a bit difficult, because all topics keep surfacing and returning throughout the whole flow of the conversation.

Disclaimer: I have done this transcription work and chosen to publish it here in good faith, for educational purposes. I don’t make any money from my website, as it is completely ad-free. In any case, should any copyright holder contact me requesting the removal of the following material, I will certainly comply.

Enjoy the conversation.


 

The interview - Day 1

 

Interviewer: Alan, the first computers were large expensive machines that had been built to do calculations. How did people think about this new technology?

Alan Kay: Well, I think that some people thought they were never going to get any smaller. There were calculations about how many Empire State buildings you needed full of vacuum tubes to do this and that. And other people at Bell Labs who had been working on the transistor had a sense that this could be used as a switch, because AT&T wanted something to replace vacuum tubes. And so I think there are several perceptions of how big the thing was going to be. I think it wasn’t until the 1960s that people started thinking that computers could get quite small.

Int: Part of the problem was the size; the other problem was just what it was supposed to be for, this machine.

AK: Yeah, I think that when they were doing ballistic calculations, there was this notion that you would run a program for a very long time. It was really after UNIVAC in the 1950s that people started thinking, “Oh, we might have to do some programming.” And people — most notably Grace Hopper very early on — started thinking about higher-level programming languages. So that was sort of one flow. And I think the other thing was, the urgency of the Cold War got people thinking of air defence. And so scientists at MIT started working on systems that could assess radar and show it to you on a display. And those are the first computers that had displays and pointing devices.

Int: For the most part, though, those were kind of exceptions. Can you describe what the batch processing experience was like, what it would have been like?

AK: Well, I always thought of it as something like a factory. You have this very large thing, and there are raw ingredients going in the front, and there are some very complex processes that have to be coordinated, then sometime later you get what you were hoping for out of the rear end. And you can think of them also as like a railroad: somebody else is deciding the schedules. I can certainly remember having maybe one run a day, or two runs a day, and you could only get your results then, and you had to work around it, and so forth. And so it was very much of an institutional way of thinking about computing.

Int: Did this put many people off computing?

AK: I don’t think so. I think, you know, happiness is… how much the reality exceeds expectations. I think that most people were happy… In the 1950s in business predominantly business computing was done with punch card machines. And IBM was the company that made those punch card machines. So there was an enormous disparity between what you could do with computers and what most people did.

Int: But still for most people the physical size and appearance of the machine was a compelling thing, and that gave it this mythology?

AK: Well, my feeling is that anything that is larger than human scale invokes mechanisms concerned with religion, so you have a priesthood with white coats, you know, all the paraphernalia were there, and some people thought it would take over the world, and some people wanted it to take over the world, and some people were afraid it would take over the world, and none of those things happened. But that was what it was when it was this large thing and towards the late 1950s many people in different places started saying, “Well, this thing has the potential of being more like a partner, maybe a complementary partner. It can do some things that we can’t do well, and vice-versa, so we should find some way of bringing it down to human scale”.

Int: The complement to the individual?

AK: The partner to people, not so much directly connected to the institutions. Although, of course, the first way it was done using time-sharing, the mainframe was still owned by the institution.

Int: Okay now, of the people, of these sort of very early visionaries — I mean, you’ve divided them into two groups, haven’t you?

AK: Yeah.

Int: Could you talk a bit about that?

AK: Which two groups? Give me a clue. [laughs]

Int: Well, one group was the ones who were interested in amplifying intelligence…

AK: There are lots of different groups… So, well, I think in the late 1950s there were a variety of different conceptions of what you should do with it. Some of those conceptions were had by the very same person. So for instance, John McCarthy, who is a professor at MIT, [at the same time] wrote memos suggesting we should time-share the computer, and he also thought [that] more into the future we’d be all networked together, and there would be these huge information utilities — it would be like our power and lighting utilities — that would give us the entire wealth of the knowledge of man. And he suggested on that what we’d have to have is something like an intelligent agent. An entity, not maybe as smart as us, but an expert in finding things, and we would have to give it advice. He called it the advice-taker.

Int: Tell me about time-sharing, because obviously one thing people would say is, “How could everybody have their own computer?”

AK: Well, I think that was it. They cost several million dollars back then, and people had been fooling around with things [that were a] little bit like time-sharing before the official invention of it at MIT. And part of it was just to debug programs. The idea is that debugging is one of the most painful processes. Incidentally you probably know that there was actually a bug. The first bug was a moth, I think, found by Grace Hopper. […]

Int: Tell me about the basic concept of time-sharing. What was the idea?

AK: The idea was that when humans use computers, the best way of having them use it is, instead of using all the power for five minutes a day, [you] take those five minutes of computer power and spread it out over several hours. ‘Cause people are slow at typing, and when you’re debugging you don’t need to do long runs of things. And what you want is lots of interaction, lots of finding out that you’re wrong, and this is wrong, you can make a little change, and this is wrong, etc. So you want to go from something that requires enormous planning to something that is much more incremental. And this is the urge that the people who developed time-sharing had.

Int: But how could you give an individual exclusive access to a computer?

AK: Well, you couldn’t. So the idea is that if the thing cost several million dollars, as McCarthy pointed out, one of the things that you could do is roll in one person’s job and do it for a couple of seconds, then roll in another person’s job and do it for a couple of seconds. And if you had a fast-enough disk for holding all of these jobs, then you would be able to handle 20–30 or more people at once, and they wouldn’t feel that the lag of a few seconds was going to hurt them.

Int: So the computer is switching between the jobs so fast, is that the way it works?

AK: Right, right. Well, that’s the way it’s supposed to work. And of course, the thing that drove personal computing in the 1960s into existence was that it proved to be extremely difficult to get reliable response time.

Int: But this is like one of the early ideas of the user having an illusion.

AK: Yes, indeed. In fact, the best one of these was a system called JOSS at RAND Corporation in which the system was devoted entirely to running this single language. That made things much better. The language was designed for end users. It was the first thing that affected people the way spreadsheets do today. It was designed— it had eight users on this 1950s machine, but the guy who did it, Cliff Shaw, was a real artist, and the feeling of using this system was unlike that of using anything else on a computer. And people who used that system thought of wanting all of their computing to have the same kind of æsthetic, warm feeling that you had when using JOSS.

Int: If you had to list — if you were a very clear-thinking person back then in the late 1950s, who knew everything they wanted computing to be, what would they have: small, interactive, real-time…?

AK: You know, these things go in waves. The emphasis in the 1950s was first on being able to do things incrementally; second, people wanted to share, so the idea of electronic mail was very early at MIT, and at some point people started thinking that the form of interaction started to have a lot to do with how good you felt, and how puzzled you were, and so forth. The invention of computer graphics, modern computer graphics, by Ivan Sutherland in 1962 had a lot to do with people’s perceptions. ‘Cause once you saw that, you couldn’t go back. It established a whole new way of thinking about computer interaction, and to this day it has remained an inspiration.

Int: Tell me about that piece of work. He [Sutherland] was a graduate student, wasn’t he?

AK: Yes, he had gone to Carnegie Tech, now Carnegie Mellon, and came to MIT to do his Ph.D. work. One of his advisors was Claude Shannon, and another one was Marvin Minsky. And as he has told the story several times, is that they were then in just the early stages of Jack Licklider’s dream to have the computer be a sort of a symbiotic partner. And when he went out to Lincoln Labs people were starting to think about that, and there was a marvellous machine there called the TX‑2. This was one of the last computers in the US large enough to have its own roof, you know, it was one of these enormous machines originally built for the air defence system. And Ivan made friends with people there and started thinking about something having to do with computer graphics.

The air defence system used displays for putting up radar plots, and so forth. And light guns were already around, and so forth. And so he started thinking about maybe doing a drafting system, and, as I recall it, one of his original reactions when seeing the kind of graphics you could put on a screen… (Because the screens couldn’t even draw lines. When they put up a line, it was put up with lots of individual dots. And done fairly slowly, so it would flicker and it was pretty awful-looking.) …Ivan at that point said the best words a computer scientist can ever say, which are, “What else can it do?” And so he got… in fact having the display not be great helped what he tried to do on it, because he started thinking of what was the actual power and value of the computer. Now, today we have a problem because the displays are good, so our tendency is to simulate paper. But what Ivan started thinking about is, what kind of value could the computer bring to the transaction so it would be even worthwhile to sit down and use such an awful-looking display.

And the thing he came out with was that the computer could help complete the drawings, so you could sketch — this is where the idea of Sketchpad [comes from] — you could sketch something in. If you were trying to make a flange, you just put in the lines for the flange, and then you would tell Sketchpad to make all of these angles right angles. And you could make these two things proportional to each other, and so forth. And Sketchpad would solve the problem and straighten out the drawing in front of your eyes into something that was more like what you wanted. So that was a terrific idea.

And then he took it another step further, because he realised he could solve real-world problems. So you put a bridge into Sketchpad: it’d never been told about bridges before, but you could put bridges in and tell Sketchpad about pulling and pushing of things, and hang a weight on the bridge, and Sketchpad would generate the stresses and strains on the bridge. So it was now acting as a simulation. You put in an electric circuit. Sketchpad had never heard about electric circuits before, but you could put in Ohm’s law, and what batteries do, and it would, in order to settle the constraints… — Or one of the nicest things was, you could put in mechanical linkages. So you could do something like a reciprocating arm, like on a locomotive wheel; for going from reciprocating to circular motion. And Sketchpad’s problem solver, if it had a problem that it couldn’t get an exact solution for, or there wasn’t just one solution — and of course there isn’t for this [makes a rotating motion with his hands] — what it would do is iterate through all the solutions, so it would actually animate this thing. And right on the screen you would see this thing animating, it was the very thing that you were thinking of.

Int: And this is very early, it sounds very modern.

AK: 1962, yeah. Well, you can’t buy a system today that does all the things that Sketchpad could back then. That’s what’s really amazing. It [was] the first system that had a window, the first system that had icons. Certainly the first system to do all of its interaction through the display itself. And for a small number of people in this community — the Advanced Research Projects Agency research community — this system was like seeing a glimpse of heaven. Because it had all of the kinds of things that the computer seemed to promise in the 1950s. And practically everything that was done in the 1960s in that community, and into the 1970s, had this enormous shadow cast by Sketchpad, or you could maybe think of it better as a light that was sort of showing us the way.

Int: And that’s one glimpse of heaven. Another glimpse of heaven I suppose is the work of Doug Engelbart.

AK: Yes, and he was also funded by the Advanced Research Projects Agency, and his original proposal I think was also 1962. 1962 is one of those amazing years. And Engelbart had read about the Memex device (described by) Vannevar Bush, who was President Roosevelt’s science advisor and a former professor at MIT; he had written an article in 1945 called As We May Think. Most of the article was devoted to predictions of the future, and one of them was, he said that sometime in the future we’ll have in our home a thing like a desk, and inside the desk on optical storage will be the contents of a small town library, like 5,000 books. There’ll be multiple screens, there’ll be pointing devices, and ways of getting information in, and pointing to things that are there. And he said that you can form trails that will connect one piece of information to another. He invented a profession called “Pathfinding” that there’d be people called Pathfinders who sold paths. You could buy a path that would connect some set of interesting things to some other set of interesting things.

And so this is a complete vision in 1945. And a lot of people read that. I read it in the 1950s when I was a teenager because I had seen it referred to in a science fiction story. Engelbart had read it fairly early, when he was in military service. And once you read that thing you couldn’t get it out of your mind, because it was the thing that anybody who deals with knowledge would desperately like to have. And so Engelbart in the early 1960s started writing proposals, and he finally got ARPA interested in funding it and they started building a proposal of his. And a couple of years later, 1964, he invented the mouse — to have both a better pointing device than the light pen, and a cheaper one. And they built a system that by 1968 was able to give a very large-scale demonstration, to 3,000 people in San Francisco.

Int: And you were there.

AK: I was there.

Int: Tell me about it.

AK: Well I had seen the system beforehand, because of course I was a graduate student in this community. But still, even having seen the system, the scale of the demo and the impact it had, it was unbelievable. I remember it started off, there was about 3,000 people in this auditorium. It was at the Fall Joint Computer Conference, I think, and all you could see on the stage was this figure with something in his lap and a box in front of him and a couple of things that looked like TV cameras around him. And he had on a headset and he started talking, he said, “I’d like to welcome you to our demonstration.” All of a sudden his face appeared 20×30 ft wide on this enormous screen, because they borrowed one of NASA’s… one of the first video projectors. They used this huge situation display, and then they used video editing so you could see while he was giving this demonstration what he was doing with his hands, with the mouse on the key set, what was going on the screen, and so forth.

Int: You were talking about Doug. What did he demonstrate, what sort of things did he show you?

Kay: Douglas Engelbart started off just showing us how you could point to things on the screen and indicate them, and started off actually fairly simply — just showing how you could look at information in various ways. He showed something very much like HyperCard, so he had a little map of how he was going to go home that night, how he was going to go to the library and the grocery store and the drugstore, and so forth. You could click on each one of those things and it would pop up and show him what he had to get there. And what he demonstrated were the early stages of what we call hypertext today. Lots of response in the system. One of the big shockers was, midway through the thing he started doing some collaborative work — and all of a sudden, in an insert on the screen, you saw the picture of Bill Paxton who was 45 miles away down in Menlo Park, live. Both of them had their mouse pointers on the screen, and they were actually doing the kinds of things that people still dream about today.

So this was a complete vision. And I think of it as the vision of what we today call personal computing or desktop computing. Except for the fact that the processor was a big time-shared computer, all of the paraphernalia… — Engelbart used a black and white display, 19-inch display, using video to blow up a calligraphic screen, and had a mouse. If you looked at the thing today, you’d see something that looked like somebody’s office that you could walk into. And what remained was to do something about the problems in response time and all that stuff. And that was something that I had gotten interested in a few years before. And the first machine that I did was a thing called the FLEX Machine and…

Int: Just before we go into FLEX, tell me what was the reaction like among the computer community?

AK: […] ARPA had spent something like $175,000 on this demo and everybody in the ARPA community wanted to show up. I recall that the crowd, you know, [Doug] got a standing ovation and he won the best paper in the Fall Joint Computing Conference and so forth. And what was even better is that he had bought up four or five terminals to the system and had them in a room, and people could go in and actually learn to interact with the system a bit. So it was a large-scale demonstration. I don’t think that anybody has ever traced what people did over the next 15 or 20 years as a result of having been at that demonstration. That would be interesting.

Int: Doug thought– hoped it would change the face of computing and is very disappointed that it didn’t. He thinks it didn’t really have much impact.

AK: Well, I mean, we thought of Doug as Moses opening the Red Sea. You know, he was like a biblical prophet. And like a biblical prophet he believed very much in his own particular vision. And that vision was not a 100% good idea. One of the things that they neglected completely was the learnability of the system. People who used the system were all computer experts who loved it, were willing to memorise hundreds of commands in order to be… If you memorised hundreds of commands and you learned how to use the key set, you could fly through this n‑dimensional space. It was quite exhilarating to watch people doing this and exhilarating to learn how to do it. The problem with it, though, was that there were so many barriers towards learning. And there were many other things that were problems. It wasn’t a particularly good simulation of paper, partly because he didn’t want it to be. And so the idea that there would be a transition period where you would be producing documents — of course, they printed documents out, but there was no notion of desktop publishing there. The whole system was rather like a violin. And if you were willing to learn how to become a violinist you could play incredible music through the thing. And of course all of us were so completely sold on this system in the late 1960s. And the first machine that I did, the FLEX Machine, was an echo of this.

Int: Now, the FLEX Machine. The other element which is in this, apart from these fantastic software achievements of Sutherland and Engelbart, is of course the size of the machine. There were some precedents, weren’t there?

AK: Oh yes. The first personal computer, in my opinion, was the machine called the LINC. If you include size as one of the important things. Of course, you could say that the Whirlwind was a personal computer at MIT, or the TX‑2. Some people tried to get Ivan Sutherland to write a paper called When There Was Only One Personal Computer. And that was him using Sketchpad on the TX‑2, which is this thing bigger than a house. But in 1962 Wes Clark did a machine in which part of the design parameter was [that] it was small enough to look over when you’re sitting down at a desk. So, it was not supposed to loom over you, it was something you could actually see over it. And many important things were done on that machine. In fact, quite a few hundred, if not a few thousand of them were built and used in the biomedical community. It was designed for biomedical research, designed to be programmed by its users who were not computer scientists, even designed to be built by non-computer scientists. They used to have gatherings in the summertime where 30 people or so would come and build their own LINCs and then take them back, and stuff.

It was a great little machine. It had a little display, and display editors, and so forth. And so it was something that you could point to when you were trying to think of what to do. And there were other small things. There was a machine called the IBM 1130, which was really an abortion of a machine. It was sort of a keypunch keyboard hooked to one of the early removable disc packs. I mean this was a mess of a machine, but it was the size of a desk and you could sit down. It wasn’t designed really to be programmed except by storing programs on cartridge. It was very funny, you could only store data on the disc. IBM forced you to put the programs on punchcards and that was the only way you could feed them in, it was really hilarious. So there were lots of different kinds of things like that.

Int: What were you trying to do with FLEX?

AK: I worked on this machine with a guy by the name of Ed Cheadle who was trying to– really trying to invent what today we would call personal computing. And he had a little machine and he had a little Sony television set, and what he wanted was something for engineers (he was an engineer) that would allow them to flexibly do calculations beyond the kinds of things that you do with a calculator. So you should be able to program the machine. You should be able to store the programs away. You should be able to get it to do things. And then I sort of came and corrupted the design by wanting it to be for people other than engineers. I’d seen JOSS, and I’d also recently seen one of the first object-oriented programming languages, and I realised how important that could be. And then Cheadle and I escalated — the problem is that he and I got along really well, and so we escalated this design beyond the means of the time to build it practically.

But we did build one, and it had many things that people associate today: it fit on top of a desk (special desks, because it weighed hundreds of pounds). But it had a fan likened to a 747 taking off, because the integrated circuits back then had maybe 8 or 16 gates on a chip. So this thing had about 700 chips in it. It had a high-resolution, calligraphic display. It had a tablet on it, and it had a user interface that included multiple windows, things like icons and stuff. But it was rather like trying to assemble a meal, maybe make an apple pie from random things that you find in the kitchen — like no flour, so you grind up Cheerios, you know? [laughs] — You wind up with this thing that looks sort of like an apple pie, but actually it isn’t very palatable. So the result of this machine was a technological success and a sociological disaster. And it was the magnitude of the rejection by non-computer people we tried it on, that got me thinking about user interface for the first time. And I realised that what Cliff Shaw had done in JOSS was not a luxury, but a necessity. And so it led to other ways of looking at things.

Int: Originally you were saying people thought of mainframes as like factories, right? These early attempts at personal computers, are what, like [Ford] Model T…?

AK: Yeah, I think one of the ways that we commonly thought about Engelbart’s stuff was [that] he was trying to be Henry Ford. That you could think of the computer as a railroad, and the liberating thing for a railroad is the personal automobile. And Engelbart thought of what you were doing on his system as travelling, so you’re moving around from link to link in the hyperspace. And he used terms like thought vectors and concept space, and stuff. Nobody knew what it meant, I’m not sure he did either, but it was that kind of a metaphor.

Int: What’s wrong with that metaphor, ultimately?

AK: Well, I don’t think there is anything particularly wrong with it, but when you’re doing things by analogy, you always want to try and pick the right analogy because there’s so many ways of making them. And the thing that was limiting about it, when you apply it to humanity, as an example, is a car you expect to take a few months to learn how to [drive] it, that was certainly true. It’s something that doesn’t extend into the world of the child. There’s a whole bunch of things — but of course we didn’t think of it that way. We thought of the car as one of the great things of the 20th Century, and it changed our society and everything. So, we were definitely using that as a metaphor.

And in fact the thing that changed my mind had nothing to do with rejecting the car as a metaphor: it was finding a better metaphor, one that completely possessed me. And that came about from seeing quite a different system. I had called the FLEX machine a personal computer. I think that was the first use of that term. While I was trying to figure out what was wrong with it, I happened to visit RAND Corporation over here in Santa Monica and saw sort of a follow-on system to JOSS that they had done for their end users who were people like RAND economists. These people loved JOSS but they hated to type. And so, in the same year the mouse was invented, the RAND people had invented the first really good tablet.

It was a high-resolution thing. And they decided that the thing to do was to get rid of keyboards entirely, and so the first really good hand-character recogniser was developed there. And they built an entire system out of it called GRAIL, for GRAphical Input Language. So there’s no keyboard at all. You interacted directly with the things on the screen. You could move them around. If you drew a square on the screen, it recognised you were trying to draw a square, it would make one. If you put in your hand-printed characters, it would recognise them and straighten them up. The system was designed for building simulations of the kinds that economists and other scientists would like to build. And using this system was completely different from using the Engelbart system. And this system, it felt like you were sinking your fingers right through the glass of the display and touching the information structures directly inside. And if what Engelbart was doing was the dawn of personal computing, what the RAND people were doing was the dawn of intimate computing.

Int: And in intimate computing you forget that it’s a machine and you think of it more as a medium.

AK: Yeah. One of the things that completely took hold of me in using the GRAIL system was it felt more like a musical instrument than anything, because a musical instrument is something that– most musicians don’t think of their instruments as machines. And it’s that closeness of contact, the fitness, that you’re directly dealing with content rather than the form of the content, that possessed me very strongly. That was in 1968 as well, and I saw another several things. I saw Seymour Papert’s early work with LOGO: here were children writing programs — and that happened because they had taken great care to try and combine the power of the computer with an easy-to-use language. In fact they used the RAND JOSS as a model, and used the power of LISP (which had been developed a few years before as an artificial intelligence language), put them together, and that was the early LOGO. And to see children confidently programming just blew out the whole notion of the automobile metaphor.

And the thing that replaced it was that this is a medium. This is like pencil and paper. We can’t wait until the kids are seniors in high school to give them driver’s ed on the thing: they have to start using it practically from birth the way they use pencil and paper. And it was destined not to be packaged on the desktop, because we don’t carry our desks with it. It had to be something much smaller. And that was when I first started seriously thinking about a notebook computer. And of course the first thing I wanted to know after deciding that it had to be no larger than this [uses his hands to convey the size of a large book], was when would that be possible, if ever?

So I started looking at what the integrated circuit people were doing — Gordon Moore and Bob Noyce, etc. — and there were these nice projections, that they as confident physicists had made, about where silicon can go. And what’s wonderful is, these projections have only been off by a few percent now, more than 20 years later. And of course I was very enthusiastic, I would believe anything that was in the right direction. So I took this hook, line, and sinker and said, “Okay, 20 years from now we’ll be able to have a notebook-sized computer that we can not only do all the wonderful things on computers, we can do mundane things to, because that’s what paper is all about. You can write a Shakespearean sonnet on it, but you can also put your grocery list on it”. So one of the questions I asked back in 1968 is, “What kind of a computer would it have it to be for you to do something so mundane as to put your grocery list on it, be willing to carry it into a supermarket, and be willing to carry it out with a couple of bags of groceries?” There is nothing special about that. You can do it with paper. So the question is… see the question is not whether you replace paper or not. The question is whether you can cover the old medium with the new. And then you have all these marvellous things that you can do with the new medium that you can’t do with the old.

Int: Many people find the idea of a medium a tricky concept. In the sense you were talking about writing and so forth or music, what do you mean?

AK: Well, I think most of my thoughts about media were shaped by reading McLuhan, which not everybody agrees with, but one of his points is that the notion of intermediary is something that is not added on to humans. It’s sort of what we are. We deal with the world through intermediaries. We can’t fit the world into our brain. We don’t have a one-to-one representation of the world in our brain. We have something that is an abstraction from it, and so the very representations that our mentality uses to deal with the world is an intermediary. So we kind of live in a waking hallucination. We have language as an intermediary. We have clothes as an intermediary. So this whole notion of what we think of as technology, could also be replaced by the word medium. And I think — even though media has this connotation of, you know, the news business and stuff like that — I think it’s an awfully good word because it gives us this notion of something being between us and direct experience.

Int: Now, of those media, writing and printing (obviously as media we can all think about) have great power, and music as well. What does that mainly consist of?

AK: I think that the trade-off with using any kind of intermediary is that every time you put something between you and direct experience you’re alienating a bit. And what you hope to get back from it is some kind of amplification. Various people who have studied evolution talk about the difference between the direct experience you get from kinesthetically touching something to the indirect experience you get of seeing it. And one is less involving than the other. The seeing is a good one because it means you don’t have to test out every cliff by walking over it. And so there’s an enormous survival value about stepping back. To have a brain that can plan and envision things that might happen, whether as images or even more strongly in terms of symbols, is of tremendous survival value because it allows you to try out many more alternatives. And so, as the existentialists of this century have pointed out, we have gained our power over the world at the cost of alienating ourselves.

Int: …A computer is anything but a machine in a certain way. Why should that be a medium?

AK: Well, I think machine is a word that has a connotation that’s unfortunate. Yeah, maybe what we should do is either elevate what we think of when you say ‘machine’, you know, mechanism kind of thing, or maybe use a different word. To most scientists, machines are not a dirty word. Because one of the ways we think about things — even in biology these days — is that what we are is an immense, immensely complicated mechanism. That’s not to say that it’s predictable in the sense of free will, because there’s this notion now, [which] people are familiar with, that’s called chaotic systems, systems that are somewhat unstable. There’s a lot of instability and stuff built in. It’s a little more stochastic in a way, but the idea that there are things operating against each other, and they make larger things, and those things are part of larger things, and so forth, to most scientists it seems like a beautiful thing. It’s not a derogatory term.

Int: I think it’s also that most machines were built to do useful work, but this machine in processing information has properties which are sort of linguistic as well as…

AK: Yeah, I think the connotation of machine is something that has to do with physical world. Most people haven’t learned mathematics, have never encountered the notion of an algorithm, which is a process for producing– you could think of you’re moving around physical things called symbols. I mean something physical is always being moved around, that’s the way scientists look at the world. And the real question is, “Is this a trivial example of it, or is this an unbelievably interesting example of it?” So a flower is a very interesting example of a machine, and a lever is a somewhat trivial and more understandable notion of a machine.

Int: Now these machines can move around symbols presumably — like the electronic voltages or whatever — so fast that they can dynamically simulate virtually anything you want…

AK: Lots of things, yeah. One way to think of it is, a lot of what a computer is, is markings. And it doesn’t matter whether you make the markings for a book on metal or paper, or you make them in clay, [these are] all kinds of ways of making the equivalent of a book. The main thing about a marking is you have to have some way of distinguishing from other markings, and once you’ve got that then they can act as carriers of different kinds of descriptions. And the range of the markings that you can have in a book, and the range of markings you can have in a computer are the same.

Int: So, like, in a computer it started with punch cards.

AK: Yeah, it doesn’t matter what it is. There’s a wonderful computer built out of Tinkertoy in the Computer Museum of Boston. It was done by some MIT grad students. You can make it out of anything. So that’s one set of ideas, that is, the representational properties of things are very very much like a book, but then the computer has this even more wonderful thing, that it’s a book that can read and write itself. And moreover it can do it very quickly. So it has this self-reflexivity that you usually only find in biological systems. And it can carry through these descriptions very, very rapidly. And then you get… that’s the big nirvana experience, where you suddenly realise, “Holy smokes! This thing is a pocket universe”. And it has a nice complementary role to the way we deal with our physical universe as scientists. The physical universe, especially in the 19th Century, was thought that it was put there by God and it was the job of scientists to uncover this glorious mechanism. So in the computer what you do is, you start off with a theory, and the computer will bring that theory to life, so you can have a theory of the universe that is different than our own, like it has an inverse cube law of gravity. And you can sit down and in not too much time you can program up a simulation of that, and discover right away that you don’t get orbits anymore with planets. And that’s curious, and a little more thinking and deeper delving will soon make you discover that only the inverse square law will actually give you orbits.

Int: That’s curious, but the other way you put it once is you could invent a universe with no gravity, right?

AK: Yes, and in fact it’s again the difference between a physical book, which you can drop it and it will fall down to the floor, and the story that’s in the book that could be about a place that has no gravity, like out in space. And the computer is at once a physical thing, and the stories that it holds don’t have to have anything to do with the physical universe its components reside in. So, in other words, you can lie, both with books and with computers. And it’s important, because if you couldn’t lie you couldn’t get into the future. Because we are living a world of things that were complete lies 500 years ago.

Int: Now, the thing about the computer, apart from being dynamic, [is that] its ability to simulate means that it can embody all other media. It is the first meta-medium, isn’t it?

AK: Yes, yes, I called it the first meta-medium in the early 1970s when I was trying to explain what it was as being distinct from big mainframes with whirling tape drives. The physical media that we have, like paper, and the way it takes on markings are fairly easy to simulate. The higher the resolution display you get, the more you can make things look like a high-resolution television camera looking at paper. That was one of the things that we were interested in at Xerox PARC — to do that kind of simulation. And there are deeper simulations than the simulation of physical media. There’s the simulation of physical processes as well.

Int: Now, this is pretty mind-blowing, because the computer then is not just important as a versatile tool, a new machine. What we’re talking about is something of the gravity of the invention of writing or printing. In popular terms, we’re talking about something really pretty fantastic.

AK: Right. It’s sort of sneaking its way in the way the book did. The Catholic Church thought that the book might be a way of getting more bibles written in Latin, and so by the time they started thinking about suppressing it, it was too late because, of course, you could print all these other things. And all of a sudden you go from having 300 or 400 books in the Vatican Library in the year 1400, to something 100–150 years later where there are some 40,000 different books in circulation in Europe, and 80% of the population could read them. And so all of a sudden you have something totally at odds with most ways of dealing with religion, which is multiple points of view.

Int: What can we learn, looking at the history of the book? Clearly it took a lot longer to happen, for one thing. But what were the stages it had to go through to get to that point?

AK: Well, I think in doing these analogies, analogies can be suggestive, you know. And of course, the amount that analogy carries through from one place to another depends on lots of things. I happen to like the analogy to the book quite a bit, because you have several stages. You have the invention of writing, which is a huge idea, incredibly important. And the difference between having it and not having it is enormous. The difference between having computers — even if they’re big, just big mainframes — and not having them is enormous. There are just things you cannot do without them.

Then the next stage was the Gutenberg stage, and just as McLuhan liked to say, “When a new medium comes along it imitates the old”, so Gutenberg’s books were the same size as these beautiful illuminated manuscripts that were done by hand. And big like this [makes a gesture with his hands to indicate the size] and in the libraries of the day, back then, the books generally weren’t shelved. There were so few books in a given library that they actually had their own reading table. And if you look at woodcuts of the day, it looks for all the world like a time-sharing bullpen. There’s one library in Florence that’s set up that way. You go over to the table that the particular book is at, they were chained in because they were too valuable, they were priceless.

And Gutenberg imitated that, but of course he could produce many more books. But he didn’t know what size they should be, and it was some decades later that Aldus Manutius, a Venetian publisher, decided that books should be this size [makes a gesture with his hands to indicate the size]. He decided they should be this size because that was the size that saddle bags were in Venice in the late 1400s. And the key idea that he had was that books could now be lost. And because books could now be lost, they could now be taken with you. And they couldn’t be like this [makes a gesture with his hands to indicate a huge book], they had to be something that was a portable size. And I think in very important respects where we are today is before Aldus. Because this notion that a computer can be lost is not one that we like yet. You know, we protect our computers. We bolt them to the desk and so forth.

Int: Because they’re still quite valuable.

AK: Still quite expensive. They’ll be really valuable when we can lose them.

Int: The other point about… the point of literacy, is also well taken there. While there are no books to read, I presume there’s no point in it…

AK: Right, well, if reading and writing is a profession, you don’t have the notion of literacy and illiteracy. Like, there’s no word in our language that stands for ill-medicine. There is medicine, which is practiced by professionals, and there is no notion of ill-medicine. Now if we all learned medicine — and it may be some day that staying healthy might be something that becomes an important part of everyone’s life — then there’ll be a notion of medicine and ill-medicine. And so it wasn’t until there was a potential for the masses to read, that we had the notion of literacy and illiteracy. And to me literacy has three major parts that you have to contend with. One is, you have to have a skill to access material prepared for you by somebody else, regardless of what it is that you’re trying to be literate in. In print medium it’s called reading, reading/accessing. I don’t think many people in our society would think that a person who just could read was literate. Because you have to be able to create ideas and put them into the information stream, so you have creative literacy, which is the equivalent of writing.

And then finally, after a medium has been around for a while, you have the problem that there are different genres of these things, so that you have to go into a different gear to read a Shakespearean play. Now there’s a different way of doing it. The way they wrote essays 200 years ago is different than the way they write essays today, and so you have to have some sort of literacy in the different styles for representing things. And so I think those three things when something new comes along, since we’re talking about media, we could talk about mediacy and ill-mediacy, or computeracy and ill-computeracy, or something like that. And I think each one of those things is a problem that has to be solved. There’s the problem of, when you get something new made for a computer, how can you be able to do the equivalent of [reading] it, of [accessing] what it is? You go to a computer store and get some new wonderful thing, slap it into your computer, and if you have to learn a new language in order to use the thing, then things are not set up for you being computer literate.

Int: So writing in latin would be analogous to rather difficult…

AK: Yeah, I think it has a lot to do with that; and again, to sort of strain the analogy we’re making with printing, my other hero back then was Martin Luther, who a few years later had this intense desire to let people have a personal relationship with God without going through the intermediaries of the church. And he wanted people to be able to read the bible in their own language, and knew that German — which was a bunch of different regional dialects back then, mostly for talking about farming and stuff — was not going to be able to carry the weight of the bible as it was portrayed in Latin, which is a much more robust and mature language. So Martin Luther had to invent the language we today call High German, before he could translate the bible into it. And so the language that Germans speak today is in a very large part the invention of a single person, who did it as a user interface, so that he could bring the media much closer to the people, rather than trying to get everybody in Germany to learn Latin.

Int: Now with that sort of insight, we could think of this whole process… it literally took a long time, took many centuries; [while, as] you say, the early phases of the computer have gone quite much more rapidly, haven’t they?

Kay: Right, things are going more rapidly. But also to the extent that analogies help, certainly that progression… I happen to be a big fan of books, and I happen to know that progression of illuminated manuscripts, which I absolutely adored when I was a child. The Gutenberg story; the Aldus story; he was one of my great heroes, because he was the guy who decided you had to read more than one book. And then Martin Luther. Those immediately sprang to mind as soon as I had decided that the computer wasn’t a car anymore. And then the thing becomes the two most powerful things that you have to do right away. You don’t have to wait to gradually figure them out. You have to get the thing small, and you have to find a language that will bring what it is closer to the people rather than the other way around.

Int: Now an opportunity came for you, I know you were going to make your Dynabook one way or another, but an opportunity came for you to get to a brand new centre when you went to Xerox PARC.

AK: Yeah, I thought up the Dynabook in 1968 and made a cardboard model of it, and a couple of years later I was about to go to Carnegie Mellon University to do the thing, and I had been consulting a little bit for the newly formed Xerox PARC, and it just got to be too much fun, so I never wound up going to Pittsburgh.

Int: What was Xerox PARC?

AK: Xerox PARC was an attempt by Xerox to spend part of its research and development money on really far-out stuff that was not going to be particularly controlled by them. In the late 1960s they bought themselves a computer company. They bought a whole bunch of publishers. They were trying to get into a larger sphere than just office copiers. One of the ringing phrases of the late 1960s was Xerox should be the architects of information. So they had this expansive feeling about getting into other areas, and they got Jack Goldman in (who was a scientist at Ford) as the new chief scientist at Xerox, and Jack realised that what we actually had to have at Xerox was something like a long-range research centre.

Xerox was a big company even then, and what Jack decided to do was to hire George Pake, who was the chancellor of Washington University in St. Louis, and George was quite familiar with ARPA, and so he hired Bob Taylor who had been one of the funders of ARPA in the 1960s. Taylor was one of the champions of Engelbart, one of the champions of the work at RAND, and so forth. What Taylor did was to, instead of hiring random good scientists, what he did was to decide to set up a miniature concentrated ARPA community. So he went out after all of the people he thought were the best in the ARPA research community, and those people were somewhat interested because the Mansfield Amendment was shutting down, constricting the kinds of far-out things that ARPA could do. So it was one of those things where the…

Int: What was the Mansfield Amendment?

AK: The Mansfield Amendment was an overreaction to the Vietnam War. A lot of people were worried about various kinds of government and especially military funding on campus, and secret funding, and all of this stuff. And the ARPA Information Processing Techniques projects got swept up in this, even though every single one of them was public domain and had already been spun off into dozens of companies, including DEC. And so this is just one of these broad-brush things done by Congress that happened to kill off one of the best things that’s ever been set up in this country. And so a lot of smart people in universities who believed more or less the same dream about the destiny of computers were gathered up by Taylor, and I was one of those.

Int: So how good were they — you were talking about the people that were gathered, how good were they?

AK: Well, I thought they were the best. […] I thought the people that were there were among the very best. In my personal pick, by 1976, which was about six years after the place was started, there were 58 out of my personal pick of the top 100 in the world. And divided into a couple of labs. People who had very strong opinions who normally would not work with each other at all, in any normal situation, were welded together by having known each other in the past, having come up through the ARPA community believing more or less the same things. Of course, when you have that kind of level of people, the disagreements in the tenth decimal place become significant. But it worked out very well, because we were able to get together completely on joint hardware projects, so we built all of our own hardware as well as all of our own software there. And the hardware was shared amongst the many different kinds of groups that were there, so there was one set of hardware, as we moved on from year to year, and then many different kinds of software design.

Int: As Xerox had this idea of architecture of information, a rather general concept, what did you guys think you were doing?

AK: Well, the psychology of that kind of person would be thought as being arrogant in most situations. We thought we knew… We didn’t know exactly what we were going to do, but we were quite sure that nobody else could do it better, whatever it was. And the architecture of information, that was a phrase that sounded good; it didn’t actually mean anything. And there was some mumbling also about “If paper goes away in the office in the next 25 years, Xerox should be the company to do it”. Because that was their heartland business, but Taylor was setting the place up to realise the ARPA dream. Which was to do man-computer symbiosis in all of the different directions that ARPA had been interested in the past, which included interactive computing, which included artificial intelligence and the notion of agents.

So there are projects that range from being intensely practical ones, like Gary Starkweather doing the first laser printer; Metcalfe and other people doing the Ethernet, the first packet-switching Local Area Net. Thacker being the main designer of the Alto which was the first workstation and the first computer to look like the Macintosh. I was head of the group that was interested in the Dynabook design, and a lot of the things that we came up with for the Dynabook were simulated on the Alto and became part of the first notion of what workstations and personal computers should be like. And those included things like the icons and the multiple overlapping windows and the whole paraphernalia that you see on machines like the Macintosh today.

Int: The basic idea was– the sort of machine you were interested in was one which had the capacity to handle rich graphical interfaces, but could be networked together. So it had to have more standalone power than normal time-sharing, is that true?

AK: Yeah. In fact, one of the things we were incredibly interested in was how powerful did it have to be. We used phrases like “We didn’t know how powerful it had to be, but it had to be able to handle something like 90–95% of the things that you would generally like to do before you wanted to go to something more–” of course, we didn’t know what that meant. We thought it would be somewhere around ten MIPS (million instructions per second), that was sort of the rule of thumb. The machine we built was effectively somewhere between one and three MIPS, depending on how you measured it. And so it was less powerful than we thought. On the other hand, it was about 50 times as powerful as a time-sharing terminal. So the magnification of what we could aspire to was enormous. And the machine was not done blindly. It was done after about a year of studying things like fonts, and what happens when the eye looks at little dots on screens, and what animation means, and we did a bunch of real-time music, very early synthesis kind of stuff, and so we knew a lot about what the machine should be and how fast it should run.

Int: So, to design this machine and the software that went with it, you studied human beings?

AK: Yeah. A lot of us did. What was nice here is, I was one of the drivers on one of these small machines. There were other groups at this time in 1971, when PARC was forming, who were building a large time-sharing system to act as a kind of an institutional resource for PARC. There were lots of different things going on. I wanted to do stuff with children because my feeling after doing the FLEX machine was that we, and I especially, were not very good at designing systems for adults. And so then I thought, “Well, children have to use this thing: why don’t we forget about adults for a while? We’ll just design a tool for children, a medium for children, and we’ll see what happens”. So I desperately wanted to get a bunch of these systems. I needed like fifteen of them. I needed to have them in a school somehow, with children, because I wanted to see how children reacted to these things, and improve the design that way.

Int: So the idea was that when you used children, it threw into focus many of the problems you had before?

Kay: Yeah. I don’t know that it threw into… The shock when you go out to real users of any kind is enormous. Because technical people live in this tiny little world, actually. We like to think it’s a big world, but it’s actually a tiny little world. And it’s full of phrases that we learned when we were taking math classes, and it’s hermetic. And it’s full of people who like to learn complicated things, they delight in it. And so one of the hardest things is for this kind of people to do any kind of user interface for people who are not like that at all. That’s the number one dictum of user interface design, is The users are not like us.

And so what you need to have is some way of constantly shocking yourself into realising that the users are not like us. And children do it really well, because they don’t care about the same kinds of things that adults do, you know, they’re not like this, they’re not like that, they can always go out and play ball… They haven’t learned to feel guilty about not working yet. And the last thing you want to do is subject the children to more school. You don’t want to put them through a training course to learn a computing system. So we use the term forcing function. It was a setup where in many ways it’s harder to design a system for children than it is for adults, along a couple of avenues. And it forced us to start thinking about how human mentalities might work.

Int: These were your Smalltalk kids, right?

AK: These eventually became the Smalltalk kids. And the lead into that was that Seymour Papert had gotten a lot of good ideas from having worked with Piaget. And Piaget had a notion of the evolution of mentality in early childhood as being sort of like a caterpillar into a butterfly. And the most important idea of it was that at each stage of development, that the kid was not a deficient adult. That it was a fully functioning organism. Just didn’t think about the world quite the same way as adults did. And other psychologists like Jerome Bruner had similar ideas, also inspired by Piaget.

Int: What were these basic stages?

AK: Well, if you break them down into just three stages, one in which a lot of thinking is done just by grabbing. A hole is to dig it, an object is to grab it. And then the next stage is very visual, in which many of the judgments that you make about the world are dominated by the way things look, so in this stage you have the Piaget water pouring experiment: you pour from a squat glass into a tall thin one, and the kid in this stage says, “There’s more water in the tall thin one”. He doesn’t have an underlying notion, a symbolic notion, that quantities are conserved. He doesn’t have what Piaget calls conservation. And conservation is not a visual idea. Visual is what does it look like when you compare it. And then later on you get a stage that Piaget called the symbolic stage, where facts and logic start to dominate the ways of thinking, and a child in that stage knows as a fact that water doesn’t appear and disappear. Nor does anything else. And so it’s able to give you judgments or inferences based on these facts rather than by the way things appear.

Now, Bruner did a lot of these experiences over again and added some sort of side trips into them. So one of the things he did in the water pouring experiment was to intersperse a cardboard after the child had said “There’s more water in the tall thin glass”, and the kid immediately changed what he had just said; he said, “Oh no, there can’t be more, because where would it come from?” And Bruner took the card away, and the kid changed back; he said, “No, look, there is more.” Bruner put the card back and the kid said, “No, no there can’t be more, where would it come from?” And so if you have any six-year-olds you’d like to torment, this is a pretty good way of doing it! And the way Bruner interpreted this is that we actually have separate ways of knowing about the world. One of them is kinesthetic, one of them is visual, one of them is symbolic, and of course we have other ways as well. And that these ways are in parallel. They really constitute separate mentalities. And because of their evolutionary history, they are both rather distinct in the way that they deal with the world, and they are not terribly well integrated with each other. So you can get some remarkable results by telling people an English sentence, which they’ll accept, and then showing them the picture that represents the same thing and have an enormous emotional reaction.

Int: How was this going to help your interface design?

AK: Well, you know, we were clutching at straws! You’ll do anything when you don’t know what to do. But since we’re going to be dealing with humans, it seemed like a reasonable thing to try and get something out of the newly-formed cognitive psychology to do it. The Piaget stuff didn’t help me much. Because one of the ways of interpreting it is that, for example, you shouldn’t probably teach first graders arithmetic, because they aren’t quite set up to receive it yet. You’d be better off teaching them something that’s more kinesthetic like geometry or topology. They’re learning, they’re tying their shoes and stuff, and there are a whole bunch of things you can do with knots and things that are higher mathematics. But the particular symbolic manipulations of arithmetic are not easy for them at that stage. Well, that’s important when you’re designing curriculum in schools, but it didn’t tell us much about a user interface.

The Bruner thing though was really exciting. Because one of the ways of interpreting it is that at every age in a person’s life there are these multiple mentalities there. Now when your kid is six years old, the symbolic one may not be functioning very much, but there are tricks that you can get it to function. And Bruner had this notion that you could take an idea that’s normally only taught symbolically like some physical principle, and teach it to a six-year-old kinesthetically, and to a ten-year-old visually. And later on to a fifteen-year-old you can teach it symbolically. This notion of a spiral.

And Papert had the same idea. If you want to learn how a gyroscope works, the last thing you want to do is look at the equation. What you want to do is do a little bit of visual reasoning about the thing, and then have one in your hand in the form of a big bicycle wheel, and when you turn that bicycle wheel and feel what it does, it clicks together with a small amount of reasoning that you’re doing, and all of a sudden you understand why it has to flop the other way. That there’s nothing that mysterious about it. But from the outside and looking at the equation, and thinking about cross products and stuff like that, it just doesn’t make any sense.

So Bruner’s idea was, when you don’t know what else to do in developing a curriculum, you try recapitulating the Piagetian stages, cast this adult level knowledge down into a kinesthetic form, and recast it later on. Teach it over and over again, recast it later on in a visual form. And he thought of these ‘lower’ mentalities, kinesthetic and iconic mentalities, as being what we call intuition. They can’t talk much but they have their own figurative ways of knowing about the world that supply a nice warm way of thinking about physical problems.

Int: Engelbart had some of these already. The mouse is a kinesthetic–

AK: Yes, but not a good one. It’s okay. But if you only use it to point on the screen, it’s not very kinesthetic. Because the major operation in kinesthesia is grabbing something and moving it. And Engelbart did not have that. Now, of course people have been moving things around in computer graphics, and most people that had ever done this liked it better. The first little paper I wrote about the Dynabook talked about moving things around, and GRAIL moved things around. Sketchpad moved things around. It’s a natural thing when you’re thinking about things from a graphic centre, which Engelbart wasn’t: even though he had little bit of graphics, he was thinking about it from text. Think about things from a graphic centre. It’s natural to want to just move something around. And so the connection with what had already been done, with what Bruner was saying, got us thinking a little bit more strongly about what the mouse was supposed to be about. The mouse was not just for indicating things on the screen, it was to give you a kinesthetic entry into this world.

Now the important thing about it is that — I don’t know how all scientific discoveries are made. They usually are connected together with little flashes of insight over long periods of time. But in fact when I started off I was quite sure that Bruner had the key, and we got the ingredients rather early for doing the user interface. But it was remarkable to me how many people we had to try ideas out on for years, for three or four years… We had to not only build the system, but in fact the early versions of the system were not as good as later versions of the system, and actually took about five years and dealing with hundreds of people to come out with the first thing that sort of looked like the Mac.

Int: And this is like the desktop…

AK: Yeah, the desktop was actually an idea we didn’t like particularly. That was something that was developed when Xerox was thinking specifically of going at offices.

Int: This was the icons, windows…

AK: Yeah, in the Dynabook idea, when you have a Dynabook and you have simulations and multidimensional things as your major way of storing knowledge, the last thing you want to do is print any of it out. Because you are destroying the multidimensionality and you can’t run them. So one of the rules is, if you have anything interesting in your computer, you don’t want to print it. And that’s why if your computer is always with you, you don’t want to [print it], you won’t print it. When you give it to somebody else, what you’re sending to them is something that was in your Dynabook to their Dynabook.

And the user interface that we worked on was very much aimed at that, and so it had a different notion than the desktop. The desktop was sort of a smaller version of it. What we had was this, you can think of it as multiple desks if you want, but we had this notion that people wanted to work on projects. And I especially liked to work on multiple projects, and people used to accuse me of abandoning my desk. You know, it got piled up, and I’d go to another one and pile it up. And when I’m working on multiple things I like to have a table full of all the crap that goes there, and another table, and so forth, so that everything is in the state it was when I last left it. And that was the user interface that we wound up doing, where you could move from one project to another project, and if you did it on the Macintosh, you could think of it as having multiple Macintosh desks, each one of which was all the tools for that particular project.

Int: […] You used the term intuitive before, or, we feel [that] things that use our kinesthetic and iconic mentalities are intuitive. Also the term leverage is used. Can you tell me what you mean when you confer leverage on a user?

AK: Yeah. Another part of my background before I was heavily into computer science, was theatre. And if you think about it, theatre is kind of an absurd setup, because you have perhaps a balding person in his forties impersonating a teenage in triumph, holding a skull in front of cardboard scenery, right? It can’t possibly work. But in fact it does all the time. And the reason it does is that theatre, the theatrical experience, is a mirror that blasts the audience’s intelligence back out at them. So it’s as somebody once said, “People don’t come to the theatre to forget, they come tingling to remember.” And so what [theatre] is, is an evoker. And again, having the thing go from an automobile to a medium started me thinking about user interface as a theatrical notion. And what you do in the theatre is to take something that is impossibly complex, like human existence or something, and present it in something that is more like stereotypes, which the user can then re-synthesise human existence out of. I think it was Picasso who said that art is not the truth, art is a lie that tells the truth. And so I started thinking about the user interface as something that was a lie, namely, that it really isn’t what’s going on inside the computer, but it nonetheless tells the truth because it evokes — it’s a different kind of computer. It evokes what the computer can do in the user’s mind, and that’s what you’re trying to do. So that’s the computer they wind up doing. And, and that’s what I think of as leverage.

Int: So just as with time-sharing, you sort of had the illusion that you had exclusive use of the computer. Here, you have a different illusion that you’re moving things on, that there are objects on your screen.

AK: Yes. In fact, we use the term user illusion. I don’t think we ever used the term user interface, or at least certainly not in the early days. And we did not use the term metaphor, as in desktop metaphor. That was something that I think came along later. The idea was, what can you do theatrically that will cover the amount of magic that you want the person to be able to experience? Because, again, the understanding was in the theatre you don’t have to present the world as physicists see it, but if you’re going to have magic in Act III, you better foreshadow it in Act I, so there’s this whole notion of how do you dress the set in order to set up expectations that will later turn out to be true in the audience’s mind?

Int: Now while you were working on these things, other people at PARC were building a machine?

AK: They were building various machines. And it’s kind of funny, because in the way it worked out is, I took Wes Clark’s LINC design and came up with a sort of a modern version of that in a little handheld— you know, sort of a like a suitcase, that I called miniCOM, and had done some design of it, how things would run on it… And the problem was it wasn’t powerful enough. But it was something I figured I could get 15 or 20 of them for the experiments that we wanted, and I had designed the first version of Smalltalk for this machine. And much to my amazement I got turned down flat by a nameless Xerox executive. And I was crushed. You know, I was only 31 years old then, and just — ach! — it was terrible. And so then I evolved another scheme of getting a bunch of little minicomputers, like Data General Novas, and I could only afford five of those (I had something like $230,000 or so), and I figured “Five of those, and [I’ll] take the system they were experimenting with fonts and bitmapped displays on and I’ll replicate that, I’ll get five of them…” It’s an incredibly expensive way to do it, but that would give me five machines to do stuff.

And then this Xerox executive went away on a task force, and Chuck Thacker and Butler Lampson came over to me in September of 1972 and said to me, “Have you got any money?” And I said, “Well, I’ve got about $230,000.” And they said, “Well, how would you like us to build you your machine?” And I said, “I’d like it fine.” And so what I gave them was the results of the experiments I had done in fonts and bitmap graphics, plus we had done some prototyping of animations and music. So to do all these things that had to be so and so fast. And what happened there was a combination of my interest in a very small machine and Butler Lampson’s interest in [having] a $500 PDP-10 someday. And Chuck Thacker wanted to build a machine ten times as fast as a Data General Nova. Chuck was the hardware genius, so he got to decide what it was in the end, and he went off and in three and a half months with two technicians completely built the first machine that had the bitmapped display. And he built it not as small as I wanted it, but ten times faster than I had planned, and it was a larger-capacity machine. So it was simultaneously both the first thing that was like a Mac, and the first workstation.

Int: This was the Alto.

AK: An absolute triumph. Yeah, this was the Alto.

Int: What was the bitmapped display? Why was that so…?

AK: Well, that was fairly controversial, because the problem is that bits were expensive back then. And before about 1971, bits were intolerably expensive, because in memory they’re in forms of these little magnetic cores, and just even a modest memory was many cubic feet, and required expensive read sense amplifiers and power supply — it was a mess. And in the early 1970s, Intel came out with a chip called the 1103 that had a thousand bits on it. The chips you buy today have like 4 million bits, and the Japanese are making ones with 16 million– 64 million bits. So this was a thousand bits on a chip. This is the best thing that ever happened. Because all of a sudden, everything changed. This is one of these predictions that Bob Noyce and Gordon Moore [made], that completely changed the relationship of us to computing hardware. And the people who wanted to build a big time-sharing machine at PARC, including Chuck Thacker and Butler Lampson, immediately started building one out of this memory. And it was also the right kind of thing to build the Alto out of.

But we still couldn’t have a lot of it. This was 1972 that we’re talking about. And so this notion… Bitmapped display is something that we didn’t invent. People had talked about it, there even were small versions of it. Television is something that is discrete in one dimension and analogue in another dimension, but you can think of it as being 300,000 separate pixels on a screen, and it’s something everybody would like, because it’s perfectly general. The nicest thing about television is that you don’t have to determine beforehand what kind of picture you’re going to show on it. You can show anything. You don’t have to worry. Whereas every other kind of display, you had to have some notion about what you were going to display in order to know how much power you should build into the display. So we spent a year and a half or so trying different kinds of displays and making calculations and stuff, and at some point we realised, “Oh, the heck with it, we just have to go for a bitmapped display”, and of course we didn’t know that people would settle for tiny little displays like this, you know, so the first display on the Alto actually was about two and a half times the size of the first Mac. It was about 500,000 pixels and it was an 8½×11” page display, the exact size that the Dynabook was supposed to be.

Int: Is it true you programmed the Cookie Monster? Is that true?

AK: The Cookie Monster… I invented the first painting program in the summer of 1972 and it was programmed up by Steve Purcell, who was one of our students. And again the idea was to get away from having just crisp computer graphics, as they had been in the past, because the idea was that people don’t design on computers, and one of the reasons they don’t design on computers is they can’t make things messy enough. And my friend Nicholas Negroponte at MIT was also thinking similar thoughts. He wanted people to be able to sketch as well as to render. And so I invented the painting program. The painting program was a direct result of thinking seriously about bitmapped displays. It’s maybe the first thing you think of. Because you are always painting on it. When you’re putting up characters, you’re painting characters. And it was very much like how MacPaint turned out to be many years later. I have a nice video showing it being used. And the second thing I thought of was that having a bitmapped display combined with multiple windows like the FLEX machine had, would automatically allow us to do this overlapping idea, which we all thought was terribly important, because we thought this 8½×11” display was impossibly small for all the stuff we wanted to put on, and the overlapping windows would give you a way of having much more effective screen space, as you move things around.

Int: Now this machine, the Alto, the prototype was produced in 1973. How does it compare in terms of what it’s got, compared with the Macintoshes and the machines we have now.

AK: Well, the first thing is, the Macintosh represents an amount of work — even to get the first Mac out — an amount of work equivalent to what we did at Xerox added on to it. So the screen design in the Macintosh is unique to the Macintosh and much nicer than we used. I mean there all of these extra things on it. And of course the number of tools you can buy — there’s something like 9,000 pieces of software rendered in the Macintosh user interface that you can buy for it. So the scope is enormous. But in terms of the categories it’s about the same, you know. We had both painting with bitmaps, we had crisp computer graphics, we did the first desktop publishing, the first WYSIWYG editors; and most of the paraphernalia that people work with — with one exception — were part of the original concept of the software that went on the Alto. And that exception was spreadsheets.

Int: So one of the things that is quite interesting for an outsider [is that] they see this: [in the] early, mid-1970s, most of the key ideas of that we now regard as sort of intimate computing are there. Yet Xerox is not a computer company. Did you have hope? What were your hopes when the Alto was built? Did you hope that the world would see this product?

AK: I for one oscillated between… because I thought of the Alto as just an intermediate stage. I wanted the Dynabook. And I thought of that user interface as a stopgap measure for the real thing… See, I was basically interested not in the access literacy part, which the user interface represents, but in– [There’s a technical interruption here.]

Int: Did you hope the world would get to see this? Because this was no longer a general research program…

AK: Well, one of the things that we realised early in the game with Xerox and its executives was that we were going to have to do a lot of demonstrations. Because we originally started out trying to talk to them and we wrote a bunch of papers called the Pendery Papers that expressed what we were trying to do, and those did not get received into any worthwhile corners of their minds. So we thought that we would have to do demonstrations. So we were set up to do demonstrations, we did lots of demonstrations. Also we didn’t want to be cut off from the research community that we had come from. So we wanted to be able to give demos. And the experience with Xerox went from the lab being fairly open, to periods of paranoia, where the lab was completely closed to visitors. And we used to say to Xerox, “Look, we’re years ahead of everybody else, you know. Lead time is everything in this business. And you just carry through on this stuff. Don’t worry, you can show everybody in the world this stuff. It doesn’t matter — they won’t be able to build it in time”.

Int: Were they–

AK: It’s still true. I mean, if you look at the Macintosh, which came out years later, look how long it’s taken IBM to not quite get there yet. They’re still pushing that idea — Windows 3.0 has just come out. This stuff takes years to do. If you have a few years’ lead time on anybody and your people are capable, you can sustain it.

Int: So Xerox could’ve had it all? There might not be an Apple.

AK: Well, I don’t know about that, because it’s really hard to predict that kind of future. I was amazed at the number of people, for instance, who bought IBM PCs, considering how useless I thought they were. And I think the interest of everybody in switching over to the overlapping window interface with icons and mouse shows that what people bought millions of is not what they want. They want what we wanted early in the 1970s. But I think predicting what people will do under conditions of expediency is impossible.

Int: In those days there were just a few Altos; you couldn’t buy them, right? There weren’t that many, and Xerox didn’t seem to have any immediate plans to sell them.

AK: Yeah. Well, we actually set up a production line for the things because, as I said, I needed quite a few. And they started getting popular, and so we set up a production line and I believe that upwards of 2,000 of them were built in the decade of the 1970s. Certainly by 1975 or ’76, almost everybody at PARC including the secretaries had Altos, and they were connected together by Ethernet, and there were page-a-second laser printers nearby that you could go to, so it was a very similar model to the offices that you see today. When Xerox made its fateful turndown in 1976 there were at least 500 machines operating the way machines do in the 1980s. So they were not turning down a paper idea or a single prototype: they were turning down something that had been functioning for three years.

Int: And this was turning down an idea to sell a computer with a word processor as opposed to a…

AK: Well, the story is complicated. It has to do with a bunch of things. I think the simplest way of characterising what happened is that — just as when IBM in the 1950s turned down their chance at the Xerox machine, because they ultimately thought they were in the computer business rather than the communications business — Xerox, when push came to shove, thought they were a copier company. So the problem with railroads is, they tried to make faster locomotives rather than investing in the airlines. So there was that problem. There was a problem of slight more cost in doing the Alto versus a word processor that they wanted to do. All these things contributed.

I think one of the main problems is that the executive who made the final decision had no way of personally evaluating what he was being told from different sides. So it was complicated. I mean, [in] 1976 when they had to make the decision, you figure [that] under best conditions it would take a couple of years to get a machine out. They would’ve had to charge around $20,000 or so for it, which is not bad that early, because that’s what word processors were selling for, but it was right on the edge. And by the time the Star came out, which was too little too late, it was I guess a $20,000 machine that should’ve been sold for around $10,000. The [Apple] Lisa had some of the same problem, it was a $10,000 machine that should’ve sold for a couple of thousand.

The Author

Writer. Translator. Mac consultant. Enthusiast photographer. • If you like what I write, please consider supporting my writing by purchasing my short stories, Minigrooves or by making a donation. Thank you!