Today in "LLMs can't do even simple reasoning":
Prompt: Sally (a girl) has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?
See a whole bunch of LLMs fail: https://benchmarks.llmonitor.com/sally
Today in "LLMs can't do even simple reasoning":
Prompt: Sally (a girl) has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?
See a whole bunch of LLMs fail: https://benchmarks.llmonitor.com/sally
076萌SNS is a social network, courtesy of 076. It runs on GNU social, version 2.0.2-beta0, available under the GNU Affero General Public License.
All 076萌SNS content and data are available under the Creative Commons Attribution 3.0 license.