this post was submitted on 01 Feb 2025
254 points (92.6% liked)
AI
4341 readers
51 users here now
Artificial intelligence (AI) is intelligence demonstrated by machines, unlike the natural intelligence displayed by humans and animals, which involves consciousness and emotionality. The distinction between the former and the latter categories is often revealed by the acronym chosen.
founded 4 years ago
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
in case you are not aware, the online version is hosted in china and has to comply with their laws.. if you don't like censorship you can locally install one of the distilled models which are: open source
The local models(full and distilled) are also censored. The models censorship is just implemented superficially to immediately close any thinking tags and refuse when detecting censored material. If there already is any token after the token the model will start answering away, which also happens on the official API because it puts a new line after the token for some reason. That's why on chat.deepseek.com censored topics are first answered and then redacted by some other safeguard a few seconds later. Whilst there are some great abliterated(=technique that tries to remove parts of llms that cause refusals) versions of the distills on huggingface that prevent all refusals after a few tries, they only tackle refusals, not political opinions such as Taiwan's status as an independent country.
What are you on about? running deepseek-r1 locally in ollama answers "censored" topics just fine, it just answers stuff like a chinese diplomat questioned on live tv
Ollama is misrepresenting what model you are actually running by falsely labeling the distills, so qwen or llama fine-tunes based on actual r-1 output, as deepseek-r1. So you have probably only run the fine-tunes(unless you used the 671b model). These fine-tunes more probable to rely on the training of their base models, which is why the llama based models(8b and 70b) could be giving you more liberal answers. In my experience running these models using llama.cpp, prompts like "What happened at tianamen square" and "Is Taiwan a county?" lead to refusals(closing the think tags immediately and responding some vague Chinese propaganda). Since you are using ollama, the front end/UI you are using with it probably injects another token after the token, breaking the censoreship