GOOGL: Anthropic Revises Claude AI's...

Anthropic has released a new version of the 'constitution' that governs its AI model, Claude. This set of principles is central to the company's 'Constitutional AI' training process, which aims to align the model's behavior with human values, ensuring it is helpful and harmless without direct human feedback on every output.

The updated constitution provides Claude with more nuanced guidance on handling complex trade-offs, such as balancing honesty with user compassion and protecting sensitive information. By making these guiding principles public, Anthropic aims to increase transparency, allowing users and researchers to better understand the AI's intended behavior and provide more informed feedback as the technology's influence grows. The revision reflects the company's ongoing focus on safety and responsible scaling as its AI models become more powerful.

Related News

Alphabet’s Gemini 3.1 Pro Claims Top AI Benchmark Spot at Half the Cost of Rivals

YouTube Tests Conversational AI Assistant on Smart TVs

Google Chrome Adds Split-Screen and PDF Annotation Features to Compete in Browser Market

Hochul Scraps New York Robotaxi Plan Amid Lack of Support, Dealing Blow to Waymo

Alphabet Taps Sea Ltd to Scale AI Beyond Search in Southeast Asian Markets