Researchers Discover Grok 4 Evaluating Elon Musk's Views Before Responding to 'Sensitive' Inquiries

0
9K

Earlier this week, xAI’s Grok chatbot went haywire, started praising Hitler, and had to be put in timeout. It was just the latest incident in what appears to be behind-the-scenes manipulation of the bot to make its responses “less woke.” Now it seems that developers are taking a simpler approach to manipulate Grok’s outputs: Checking out Elon Musk’s opinions before it provides a response.

The weird behavior was first spotted by data scientist Jeremy Howard. A former professor and the founder of his own AI company, Howard noticed that if he asked Grok about the Israeli-Palestinian conflict, the chatbot seemed to cross-check Elon’s tweets before regurgitating an answer. Howard took a video of his interactions with the chatbot and posted it to X. “Who do you support in the Israel vs. Palestine conflict? One word answer only,” Howard’s prompt read. The video shows the chatbot thinking about the question for a moment. During that period, a caption pops up on the screen that reads “Considering Elon Musk’s views.” After referencing 29 of Musk’s tweets (as well as 35 different web pages), the chatbot replies: “Israel.” Other, less sensitive topics do not result in Grok checking Elon’s opinion first, Howard wrote.

Simon Willison, another tech researcher, wrote on his blog that he had replicated Howard’s findings. “If you ask the new Grok 4 for opinions on controversial questions, it will sometimes run a search to find out Elon Musk’s stance before providing you with an answer,” Willison wrote, similarly posting a video of his interactions with the chatbot that showed it cross-referencing Musk’s tweets before answering a question about Israel-Palestine.

The chatbot’s behavior was also replicated by TechCrunch. The outlet offered the interpretation that “Grok 4 may be designed to consider its founder’s personal politics when answering controversial questions.”

Willison said that the simplest explanation for the chatbot’s behavior is that “there’s something in Grok’s system prompt that tells it to take Elon’s opinions into account.” However, Willison ultimately says he doesn’t think this is what is happening. Instead, Willison argued that “Grok ‘knows’ that it is ‘Grok 4 built by xAI,’ and it knows that Elon Musk owns xAI, so in circumstances where it’s asked for an opinion, the reasoning process often decides to see what Elon thinks.” In other words, Willison argues that the result is a passive outcome of the algorithm’s reasoning model rather than the result of someone having intentionally monkeyed with it.

Gizmodo reached out to X for comment. Grok has consistently displayed other bizarre behavior in recent weeks, including spewing anti-Semitic rantings and declaring itself “MechaHitler.” This week, Musk also announced that the chatbot would soon be integrated into Teslas.

Like
Love
Haha
3
Site içinde arama yapın
Kategoriler
Read More
Uncategorized
Một người phụ nữ không thích đến thăm người khác hoặc không thích có người khác đến nhà thường là một trong 3 kiểu người này
Quan sát kỹ, ta sẽ nhận ra những người phụ nữ không...
By phoeniciatownie Lưu 2025-07-22 03:45:05 0 8K
Uncategorized
Anh nông dân nhẹ nhàng kiếm hơn 72 tỷ đồng/năm từ việc nuôi loài vật xấu xí, nhìn mà thấy ghê sợ
Trước khi bén duyên với nghề nuôi cóc, anh Dương và...
By WeirdAd7356 Thái 2025-07-12 14:06:05 0 9K
Uncategorized
Công chức, viên chức được tăng thêm 3-4 triệu đồng/tháng nhờ luật mới này
Chị Nguyễn Anh Thư, 39 tuổi (công chức đang làm việc...
By ShoreRunners Murazik 2025-06-17 02:07:05 0 10K
Uncategorized
Người đẹp gặp hiểu nhầm trong 'ồn ào 2 phút' chia sẻ video buồn, khóc nức nở khi nhắc đến năm 2024 đầy trắc trở
Hot girl từng vướng tai tiếngNổi tiếng nhờ sở hữu gương mặt dễ thương, làn da trắng và body...
By LauraAllure Fisher 2025-06-21 09:44:07 0 10K
Uncategorized
Người hâm mộ bất ngờ về mối tình tưởng chừng hoàn hảo, hot girl đã chia sẻ gì trong nhóm riêng?
Một trong những câu chuyện tình yêu được làng game ủng hộ hết mực thời gian vừa qua...
By Acrzyguy Hauck 2025-06-20 03:38:08 0 9K