ChatGPT-maker OpenAI says it is doubling down on preventing AI from 'going rogue'

ChatGPT-maker OpenAI says it is doubling down on preventing AI from 'going rogue'

Sam Altman, CEO of Microsoft-backed OpenAI and ChatGPT creator listens to Ilya Sutskever, co-founder and Chief Scientist of OpenAI during a talk at Tel Aviv University in Tel Aviv, Israel on June 5.

PHOTO: Reuters

PUBLISHED ONJuly 06, 2023 1:59 AM

ChatGPT's creator OpenAI plans to invest significant resources and create a new research team that will seek to ensure its artificial intelligence remains safe for humans - eventually using AI to supervise itself, it said on Wednesday (July 5).

"The vast power of superintelligence could ... lead to the disempowerment of humanity or even human extinction," OpenAI co-founder Ilya Sutskever and head of alignment Jan Leike wrote in a blog post. "Currently, we don't have a solution for steering or controlling a potentially superintelligent AI, and preventing it from going rogue."

Superintelligent AI - systems more intelligent than humans - could arrive this decade, the blog post's authors predicted. Humans will need better techniques than currently available to be able to control the superintelligent AI, hence the need for breakthroughs in so-called "alignment research," which focuses on ensuring AI remains beneficial to humans, according to the authors.

OpenAI, backed by Microsoft, is dedicating 20 per cent of the compute power it has secured over the next four years to solving this problem, they wrote. In addition, the company is forming a new team that will organise around this effort, called the Superalignment team.

The team's goal is to create a "human-level" AI alignment researcher, and then scale it through vast amounts of compute power. OpenAI says that means they will train AI systems using human feedback, train AI systems to assistant human evaluation, and then finally train AI systems to actually do the alignment research.

AI safety advocate Connor Leahy said the plan was fundamentally flawed because the initial human-level AI could run amok and wreak havoc before it could be compelled to solve AI safety problems.

"You have to solve alignment before you build human-level intelligence, otherwise by default you won't control it," he said in an interview. "I personally do not think this is a particularly good or safe plan."

The potential dangers of AI have been top of mind for both AI researchers and the general public. In April, a group of AI industry leaders and experts signed an open letter calling for a six-month pause in developing systems more powerful than OpenAI's GPT-4, citing potential risks to society. A May Reuters/Ipsos poll found that more than two-thirds of Americans are concerned about the possible negative effects of AI and 61 per cent believe it could threaten civilisation.

ALSO READ: OpenAI rolls out 'incognito mode' on ChatGPT

Source: Reuters

Microsoft Artificial AI research Safety Digital

homepage

trending

trending

Tanjong Katong Road South repair works completed, to reopen in phases from Aug 2: LTA, PUB

Tanjong Katong Road South repair works completed, to reopen in phases from Aug 2: LTA, PUB

Trump hits dozens of countries with steep tariffs, including 35% for Canadian goods

PM Lawrence Wong to deliver National Day message on Aug 8

Malaysia tourism group says LTA crackdown on illegal cross-border ride services at Changi Airport 'inconveniences travellers'

Support local: FairPrice launches farmers market with Singapore-grown produce, includes exclusive plushies and more

$12.8m Toto jackpot won by single ticket bought online

28 arrested, luxury cars seized during anti-vice raids

Part-time PHV driver who stopped suicide attempt among 38 recipients of MHA’s public spiritedness award

Edwin Goh and Rachel Wan's wedding to be for next year: 'There's still a lot of things we need to figure out'

JB car wash operators say 'unfair' after business declines amid govt clampdown over prioritising Singapore-registered cars

A slice of America: Corvette makes its long-awaited debut in Singapore

'I'm happy taking the audience seat': Andrew Seow, now auxiliary police officer, reflects on past acting career

'Proud of what they've done': Jetstar Asia CEO expresses gratitude to crew on airline's final day of operations

Singapore

Singapore

Tan Kiat How 'heartened' as vape disposal bin in Bedok half-filled in just 4 days

ICA to issue no-boarding directives to prevent high risk travellers from entering Singapore

Nearly 27kg of cocaine found in stuffed toys at Changi Airport, 5 foreigners arrested

3-room and bigger Tampines, Toa Payoh BTO flats most popular with first-timers in July HDB launch

'On the verge of losing $10k': Vendors voice concerns about poor business at Bayfront SG60 food fair

'It was so gross': Man left disgusted after finding maggots in meal at Hougang restaurant

Safeguards in place to deter fraudulent injury claims at workplace: MOM

Man accused of raping woman who hired him to fix lights in her flat claims she made first move

Primary school student approached by vape peddlers at Dover Road; school alerts authorities

Water supply issues during Toa Payoh blaze affected firefighting operations; SCDF investigating

Entertainment

Entertainment

Gossip mill: Seventeen's Mingyu in Singapore for event, Babymonster's Chiquita receives hate presumably over Thai nationality, Jeon Somi recounts long chat with ghost

Blackpink's Rose has a Singapore pop-up where you can recreate APT music video and pick up merch

Cha Eun-woo's Memories VR concert: Become his 'girlfriend' in romantic fantasy show

Joanne Peh opens up about dealing with fame and controversies

E-Junkies: J-pop group Psychic Fever talk global goals and new EP

Blake Lively accused of harassment and intimidation

Miley Cyrus has special plans for Hannah Montana's 20th anniversary

Hulk Hogan secretly battled blood cancer before his death

Justin Timberlake diagnosed with 'relentlessly debilitating' Lyme disease

Pamela Anderson, reportedly dating Liam Neeson, says he puts her at ease during The Naked Gun filming

Lifestyle

Lifestyle

Bak kut teh ramen, laksa shakshuka and chilli crab burgers: Celebrate National Day with these exclusive SG60 meals

Japanese restaurant Umi Nami to shutter, in yet another F&B business closure at Holland Village

Uniqlo launching T-shirt collection in collab with Pokemon Trading Card Game

Second-generation owner of kueh tutu store Tan's Tu Tu Coconut Cake dies aged 63

Sierra Leone chimp refuge shuts doors to tourists to protest deforestation

I try 11 new Michelin Bib Gourmand 2025 eateries to see if they're worth the hype, here's my honest take

Michelin-starred restaurant Alma by Juan Amador to shutter in August, plans to reopen with new concept

Punggol Coast Hawker Centre just opened, look out for names like Singapore Fried Hokkien Mee and South Buona Vista Braised Duck

Premium Automobiles launches Chinese luxury EV brand Avatr in Singapore

Ka-Soh, heritage brand known for its fish soup, to shutter last outlet in Bukit Timah in September

Digicult

Digicult

Slim, sleek, but slightly too short-lived: Samsung Galaxy S25 Edge review

World's best Dota 2 teams to compete for $1m prize pool in Singapore in November

Sony RX1R III brings back the compact full-frame but not the Sony playbook

China's Premier Li proposes global AI co-operation organisation

'They don't gaslight you': Why some Singaporean women like to spend on these virtual men

Elon Musk's Starlink network suffers rare global outage

Spy cockroaches and AI robots: Germany plots the future of warfare

'Give a positive review': Hidden AI prompt found in academic paper by NUS researchers

'Report 1 shop, another 10 appear': Hoyo Fest artists on copyright struggles

NTU penalises 3 students over use of AI tools; they dispute university's findings

Money

Money

Up 4.3%: Singapore's economy grew in Q2 despite US tariff fears

Trump says US will set 15% tariff on South Korean imports under new deal

Cathay Cineplexes operator mm2 hires debt restructuring specialist as it faces more payment demands; CEO Chang Long Jong to retire

6 best travel insurance plans in Singapore (July 2025)

How to claim travel insurance? A comprehensive beginner's guide (2025)

Britain and India sign free trade pact during Modi visit

Long-time tech executive and Microsoft Singapore managing director Lee Hui Li dies

HDB launches 10,209 BTO and balance flats, as priority scheme for singles kick in

US-Philippines trade talks yield modest tariff shift after Trump-Marcos meeting

Indonesia to cut tariffs, non-tariff barriers in US trade deal

Latest

Latest

Daily roundup: Edwin Goh and Rachel Wan's wedding to be for next year — and other top stories today

Relief in Southeast Asia as Trump's tariffs level playing field

Flooding leaves 14 dead, missing in Vietnam's Dien Bien

Germany to respond to any unilateral Israeli moves on Palestinian territories, minister warns

Chongqing residents seek shelter as heatwave hits China's southwest

Indonesian President Prabowo pardons political opponents

'If the baby could speak, she would scream': The risky measures to feed small babies in Gaza

Trump administration refers Harvard to Justice Department in civil rights probe

Russia claims capture of Chasiv Yar after 16-month battle

In Case You Missed It

In Case You Missed It

Trump defers announcement of tariff rate on Malaysian goods to Aug 1, will attend Asean Summit in Oct: Anwar

Toddler wanders out of home in Selangor, mauled by stray dogs

'I was embarrassed': Malaysian security guard in viral knockout by MMA coach 'thankful' he wasn't fired

Mid-air brawl erupts on AirAsia X flight from KL to Chengdu over loud conversation

Cat A COE prices remain unchanged in second bidding exercise for July 2025

Robber drops gun and misfires after failed clinic robbery in JB

$30 one-off cash handout, petrol price cut: Malaysian PM Anwar Ibrahim announces measures to tackle living costs

'I felt helpless': Female tourist claims she was sexually harassed by ice cream vendor in Turkey

Unhealthy air quality in Malaysia, NEA warns of potential transboundary haze

This website is best viewed using the latest versions of web browsers.