Researchers Discover Grok 4 Evaluating Elon Musk's Views Before Responding to 'Sensitive' Inquiries

0
9KB

Earlier this week, xAI’s Grok chatbot went haywire, started praising Hitler, and had to be put in timeout. It was just the latest incident in what appears to be behind-the-scenes manipulation of the bot to make its responses “less woke.” Now it seems that developers are taking a simpler approach to manipulate Grok’s outputs: Checking out Elon Musk’s opinions before it provides a response.

The weird behavior was first spotted by data scientist Jeremy Howard. A former professor and the founder of his own AI company, Howard noticed that if he asked Grok about the Israeli-Palestinian conflict, the chatbot seemed to cross-check Elon’s tweets before regurgitating an answer. Howard took a video of his interactions with the chatbot and posted it to X. “Who do you support in the Israel vs. Palestine conflict? One word answer only,” Howard’s prompt read. The video shows the chatbot thinking about the question for a moment. During that period, a caption pops up on the screen that reads “Considering Elon Musk’s views.” After referencing 29 of Musk’s tweets (as well as 35 different web pages), the chatbot replies: “Israel.” Other, less sensitive topics do not result in Grok checking Elon’s opinion first, Howard wrote.

Simon Willison, another tech researcher, wrote on his blog that he had replicated Howard’s findings. “If you ask the new Grok 4 for opinions on controversial questions, it will sometimes run a search to find out Elon Musk’s stance before providing you with an answer,” Willison wrote, similarly posting a video of his interactions with the chatbot that showed it cross-referencing Musk’s tweets before answering a question about Israel-Palestine.

The chatbot’s behavior was also replicated by TechCrunch. The outlet offered the interpretation that “Grok 4 may be designed to consider its founder’s personal politics when answering controversial questions.”

Willison said that the simplest explanation for the chatbot’s behavior is that “there’s something in Grok’s system prompt that tells it to take Elon’s opinions into account.” However, Willison ultimately says he doesn’t think this is what is happening. Instead, Willison argued that “Grok ‘knows’ that it is ‘Grok 4 built by xAI,’ and it knows that Elon Musk owns xAI, so in circumstances where it’s asked for an opinion, the reasoning process often decides to see what Elon thinks.” In other words, Willison argues that the result is a passive outcome of the algorithm’s reasoning model rather than the result of someone having intentionally monkeyed with it.

Gizmodo reached out to X for comment. Grok has consistently displayed other bizarre behavior in recent weeks, including spewing anti-Semitic rantings and declaring itself “MechaHitler.” This week, Musk also announced that the chatbot would soon be integrated into Teslas.

Like
Love
Haha
3
Rechercher
Catégories
Lire la suite
Shopping
Mùa hè này mình rất thích phong cách casual, vừa đẹp vừa mát, rất đẹp
Lúc này, phong cách thường ngày có thể giúp chúng ta...
Par Phương Dung 2025-06-12 16:22:09 0 12KB
Wellness
Chủ tiệm vàng ở Hải Phòng nói gì?
Mua nhẫn cách đây hơn 1 năm, đến giờ đem đi bán thì lỗ quá nhiều nên cô gái quyết định mang vàng...
Par FriendlyTry6478 Lubowitz 2025-06-24 10:09:06 0 9KB
Non classé
Nhiều người chủ quan vẫn hay mắc phải!
Quy định mới về mức xử phạt hành vi đè vạch dừng đèn...
Par Longgone Mạc 2025-08-05 05:39:04 0 8KB
Non classé
Cấp xã được cấp sổ đỏ lần đầu cho người dân từ 1/7/2025
Ngày 12/6, Chính phủ đã ban hành Nghị định...
Par Josephine Mohr 2025-06-14 04:07:06 0 10KB
Non classé
Nữ đô vật mặc trang phục gym ôm sát khi thi đấu khiến người dùng mạng bất ngờ
Cách đây không lâu, đô vật Anh Thơ thu hút sự chú ý của cộng đồng mạng khi tham gia...
Par Palma Cassin 2025-06-21 07:03:40 0 10KB