ChatGPT creators try to use artificial intelligence to explain itself – and come across major problems
ChatGPT’s creators have attempted to get the system to explain itself. They found that while they had some success, they ran into some issues – including the fact that artificial intelligence may be using concepts that humans do not have named for, or understanding of. Researchers at OpenAI, which developed ChatGPT, used the most recent version of its model known as GPT-4 to try and explain the behaviour of GPT-2, an earlier version. It is an attempt to overcome the so-called black box problem with large language models such as GPT. While we have a relatively good understanding of what goes into and comes out of such systems, the actual work that goes on inside remains largely mysterious. That is not only a problem because it makes things difficult for researchers. It also means that there is little way of knowing what biases might be involved in the system, or if it is providing false information to people using it, since there is no way of knowing how it came to the conclusions it did. Engineers and scientists have aimed to resolve this problem with “interpretability research”, which seeks find ways to look inside the model itself and better understand what is going on. That has often required looking at the “neutrons” that make up such a model: just like in the human brain, an AI system is made up of a host of so-called neutrons that represent parts of the data it uses. Finding those is difficult, however, since humans have had to pick through the neurons and manually inspect them to find out what they represent. But some systems have hundreds of billions of parameters and so actually getting through them all with people is impossible. Now, researchers at OpenAI have looked to use GPT-4 to automate that process, in an attempt to more quickly pick through the behaviour. They did so by attempting to create an automated process that would allow the system to provide natural language explanations of the neuron’s behaviour – and apply that to another, earlier language model. That worked in three steps: looking at the neuron in GPT-2 and having GPT-4 try and explain it, then simulating what that neuron would, and finally scoring that explanation by comparing how the simulated activation worked with the real one. Most of those explanations went badly, and GPT-4 scored itself poorly. But researchers said that they hoped the experiment showed that it would be possible to use the AI technology to explain itself, with further work. The creators came up against a range of “limitations”, however, that mean the system as it exists now is not as good as humans at explaining the behaviour. Part of the problem may be that explaining how the system is working in normal language is impossible – because the system may be using individual concepts that humans cannot name. “We focused on short natural language explanations, but neurons may have very complex behavior that is impossible to describe succinctly,” the authors write. “For example, neurons could be highly polysemantic (representing many distinct concepts) or could represent single concepts that humans don’t understand or have words for.” It also runs into problems because it is focused on specifically what each neuron does individually, and not how that might affect things later on in the text. Similarly, it can explain specific behaviour but not what mechanism is producing that behaviour, and so might spot The system also uses a lot of computing power, the researchers note. Read More Google to unveil major new AI AI robots figure out how to play football in shambolic footage White House asks hackers to break ChatGPT White House reveals plan to ‘protect’ citizens from danger of AI DeepMind boss says human-level AI is just a few years away Regulator to probe use of artificial intelligence such as ChatGPT
2023-05-10 22:49
Toshiba Releases 3rd Generation SiC MOSFETs for Industrial Equipment with Four-Pin Package that Reduces Switching Loss
KAWASAKI, Japan--(BUSINESS WIRE)--Aug 30, 2023--
2023-08-31 10:21
Applied Materials forecasts first-quarter revenue ahead of estimates
(Reuters) -Semiconductor equipment maker Applied Materials on Thursday forecast first-quarter revenue above Wall Street estimates, helped by a recovery in
2023-11-17 05:28
Lenovo Reportedly Working on Its Own Steam Deck Rival
It looks like Lenovo is preparing its own competitor to Valve's Steam Deck. The company
2023-08-02 00:49
BOJ Stance to Put Japan Earnings Outlook Under Scrutiny Next Week
The Bank of Japan is set to conclude its two-day policy board meeting today. The yen climbed against
2023-07-28 11:17
8 tips for parents and teens on social media use — from the U.S. surgeon general
Dr. Vivek Murthy, the U.S. surgeon general, is calling for “immediate action” by tech companies and lawmakers to protect kids’ and adolescents’ mental health on social media
2023-05-23 19:18
California Says Electric Cars Now Make Up a Fifth of Auto Sales
One out of every five cars sold in California is now powered by a battery, registration data released
2023-11-02 04:59
Orsted Funding Gap Puts Credit Rating at Risk, Jefferies Says
Danish wind developer Orsted A/S is facing a steep balance sheet gap even after abandoning some of its
2023-11-16 16:52
Cisco cuts annual forecasts on slowdown in new orders
(Reuters) -Cisco Systems cut its full-year revenue and profit forecasts on Wednesday in a sign that demand for its networking
2023-11-16 05:48
Save $45 on an Amazon Fire HD tablet at Best Buy until August 5
Save $45: As of August 4, the Amazon Fire Tablet HD (32GB, 8-inch) is on
2023-08-05 04:25
What Time is Starfield Playable?
Here's when gamers can start playing the year's most anticipated game: Starfield.
2023-09-01 04:28
Congress Wades Into Self-Driving Debate With New US House Bills
Congress is wading into the debate over whether automakers should be allowed to sell hundreds of thousands of
2023-07-26 03:53
You Might Like...
Dell's revenue forecast signals AI boost will take longer to materialize
Winklevoss’s Gemini Crypto Exchange Sues DCG, CEO Barry Silbert
Apple's new iPad Pro will have an OLED display and new keyboard, report says
Keyboard maker Logitech raises forecast for first half of 2024
Fed warned Goldman's fintech unit on risk, compliance oversight -FT
X adds "Formerly Twitter" to App Store listing as app plunges in the charts
Audio book narrators say AI is already taking away business
What Map is Sunset Replacing in Valorant?
