Turn Down the Heat
Please
By Ed Sperling -- Electronic News, 7/7/2006
Tom Reeves, VP of semiconductor and technology services at IBM, sat down with Electronic News at the site of the companys 200mm fab and mask operation near Essex Junction, Vt., for a candid conversation about whats next in chip manufacturing, where the problems are and where future technology will come from. What follows are excerpts of that conversation.
Electronic News: Whats the next big break in chip technology?
Reeves: Through the 70s and early 80s, bipolars went up to 100 watts. We had water-cooling systems, but you needed something new. Then we started with CMOS, which was a Holy Grail step-function improvement. Now, 20 years later, weve got 100 to 120 watt chips again. Power is everything. The efforts were taking to get leakage power down for cell phones or a base station or a Cisco switch are enormous. If you look at a chip in a base station or a switch, theyre 40 watts, and there are a lot of them. The total wattage gets up to 5,000 or 10,000. So the major focus now is not on Moores Law and how you get the next density step. Well get that. How you get the next performance step is harder work than its been, too. But the most important issue is how you manage power. Leakage power at the most advanced lithography is very challenging. And with active power, can you cool the gain? College kids were hanging some gaming systems out their dorm windows to cool them down.
Electronic News: So how do we solve these problems? Are we at a point where the road map is broken and we have to re-think what were doing?
Reeves: I think we can make incremental improvements. At 65 nanometers, we have solved that. At 45, we have additional ideas. But as you look at 32 nanometers and beyond, its still an open question. At 65 and 45 were using header and footer switches to turn things on and off, dynamic voltage scaling, dynamic frequency scaling. Were qualifying processes for lower voltage levels than we would have. Weve used voltage islands, too.
Electronic News: But isnt that a Band-Aid approach?
Reeves: Id call it business as usualbrute, tough-it-out engineering. Its not like bipolar going to CMOS, though. We dont have a panacea on our road map. There are certainly some interesting technologiescarbon nanotubes are one of thembut they all look as if theyre 20 years out, not five years out. There are some ultimate solutions that are game-changers. But at 32 nanometers and the step after that, we have CMOS and SOI.
Electronic News: Whats the big bottleneck now?
Reeves: Leakage power is now equal to active power. Leakage power used to be insignificant. The Swiss and Japanese watchmakers were the only ones worried about leakage because it affected how long the battery lasted. Sharp created a new cell phone, which is only available in Japan, with an Aquos LCD screen for watching TV. Its the same material as in their TV screens. It tunes analog signals, digital signals and FM signals through an IBM silicon-germanium tuner, and it has an IBM EDRAM ASIC. They came to us and said they wanted 600,000 of each as fast as we could make them. It launched in early June in Japan. TV-tuning standards are different by country. A different model has to come out for Korea and Europe, and a different oneprobably years from nowin the United States, because we tend to lag these standard migrations.
Electronic News: Does it matter to IBM which form factors sell best and which standards are used?
Reeves: We buffer the risk for customers. For example, were not sure whether ultra-wideband, WiMax or Zigbee will win, but we have collaborative design partners on all of them. Whatever wins, were going to ride it.
Electronic News: Are there any trends about whos going to be using these new technologies?
Reeves: Well, the United States is embarrassingly delayed on cell phones technology. I was in Japan about a month ago and commenting about when a phone in the United States would be able to get these terrestrial TV broadcasts. Every one of our salespeople there opened up their cell phones. They all had TV tuners. The oldest phone was two years old. In Japan, theyve had broadcast-reception phones for two years. Korea and Japan drive these new standards aggressively. Only GPS [global positions systems] in phones seems to be rolling out as fast here. This is a new market for IBM. Weve only recently gotten into the consumer market, and its a very sophisticated market.
Electronic News: Will consumer drive the high-end of the chip market or will it still be computers and networking?
Reeves: The consumer market is going to ship more 65 nanometer earlier than the data-processing or the networking market. The network market drives 18-by-18 die in every generation, which the consumer market never will. Data processing and networking will always lead in difficulty at the mask house, yield engineering and test strategy. But in terms of using new litho nodes, the digital camera guys are further along in their plans of ramping manufacturing than the Ciscos and Junipers.
Electronic News: Lets swap directions here. A year ago you said that if IBMs customers follow its design rules, yield will be in the 90 percent range. Is that still true?
Reeves: Weve maintained that 90 percent to 100 percent range consistently for digital ASICs. We provide the entire design environment, including a test-generation methodology. Cisco will do six ASICs for a line card, and all six will be single-pass silicon. Whats new is that weve extended that approach into the world of analog. In that case we wont provide an entire ASIC design flow. We provide electrical models. But what were demonstrating is that we have a much tighter accuracy between the electrical models to the silicon we get back than other mixed-signal suppliers. Analog is something of a black art. But if you know what you want and how to design it, and assuming the electrical models are right, IBM can give you an environment where you have first-pass analog silicon. Other vendors are not that close in terms of electrical-to-hardware coordination. If it doesnt work, you have to determine whether it was the design or the electrical models. Its a very difficult process.
Electronic News: As a result of this, are you finding more buy-in from customers than in the past for your recommendations?
Reeves: I think theres a clear trend toward buy-in. More and more, people are looking for tools to help them analyze the complexity of their potential designs before they send it out, and if it yields can they drop their customer price? Those conversations didnt occur five years ago.
Electronic News: Does that 90 percent number work for analog designs, too?
Reeves: No, thats strictly for digital ASICs, where you have an IBM-managed library, an IBM timing tool and router, IBM test methodology and power management. In analog and mixed signal, weve taken the uncertainty out of whether silicon will behave exactly the same way as the electrical models we gave you. But in that area, the client is still picking what tool they want to use. It may be Cadence for one thing and Synopsys for another. We havent made an investment in an RF CMOS or a mixed-signal design system.
Electronic News: Lets look at design for manufacturability from a different standpoint. IBM has said it needs seven of the eight cores on the Cell processor to work for Sonys Playstation. Will there be an aftermarket for chips with fewer operational cores?
Reeves: There are a lot of chips with six cores operational, and weve been thinking about whether we should really throw all of those away. We also have a separate part number for chips with all eight cores good. The stuff thats going to be for medical imaging, aerospace and defense and data uses eight cores.
Electronic News: But might it be the less-expensive version of Playstation 3?
Reeves: It could, but I dont think Sony has thought about offering that. That doesnt mean there arent good uses for a chip with four SPEs [synergistic processing elements].
Electronic News: Whats the defining factor that makes some chips better than others?
Reeves: Defects. It becomes a bigger problem the bigger the chip is. With chips that are one-by-one and silicon germanium, we can get yields of 95 percent. With a chip like the Cell processor, youre lucky to get 10 or 20 percent. If you put logic redundancy on it, you can double that. Its a great strategy, and Im not sure anyone other than IBM is doing that with logic. Everybody does it with DRAM. There are always extra bits in there for memory. People have not yet moved to logic block redundancy, though.
Electronic News: Do any of those cores ever go bad, so that you start out with seven and you wind up with six or five?
Reeves: Theres a reliability failure rate for all chip types. By definition, reliability failure is one point circuit that has failed. If it happens to be in an SPE, it will knock out one of the cores. We have electronic fuses now, rather than laser fuses, which you can only blow when youre doing wafer tests. Electronic fuses you blow electrically. If you really want to be focused on reliability and up-time availability, you can design one of these chips to self-detect. You can ship it with eight cores working, blow one of them, and from a user perspective you would have self-healed it in the field.
Electronic News: But would it be as fast as the chip with eight cores?
Reeves: Yes, because the Playstation 3 only uses seven of them. Youd have a spare. That isnt implemented in Cell, but it could be. We implemented that same strategy for IBM systems. If you take a logic hit on a chip, you dont have any impact on performance because there is enough redundancy built in.
Electronic News: What happens if one of the cores blows on the Sony Playstation 3 if there are only seven to start with?
Reeves: Its just like a reliability failure on your TV or DVD recorder. If its within warranty, you send it back. If its not, your game doesnt work anymore. Youll always have choices about how reliable you want to make a chip with burn-in. Most chips that go into the consumer marketplace on things such as camcorders or DVD players arent burned in. But you can add burn-in and improve reliability 5x to 10x. Its extra cost. Certainly, a company like Sony adds that in.
Electronic News: How much extra cost?
Reeves: Its variable. On DRAMs and SRAMs, its cents. On processors, because theyre so high-powered, its not trivial to power 100 or 1,000 at a time. With all the wattage, it can be dollars.
Electronic News: With the price Sony is going to charge, it can easily add that into the cost.
Reeves: Sony is very concerned about quality and backward compatibility. They want to get this right. They tested game after game after game. When there were about 40 Playstation 1 games that didnt work properly, that didnt pass their criteria for quality.
Electronic News: So does that mean the current Playstation 2 systems have a Cell processor?
Reeves: No, they have a 440 Power processor. Its a 130-nanometer, single-core ASIC chip. Its the same technology as if you buy a Sony DVD or a Sony Bravia TV. Sony is replacing all the Mips design points with Power design points.