4 Comments
User's avatar
Peter Luh's avatar

Anyone curious about automatically pulling together query responses from multiple chatbots might want to explore Alan's open-source Botwell project. I’ve found it really eye-opening and worth a look at https://github.com/alanwilhelm/botwell/tree/main

Expand full comment
Alan Wilhelm's avatar

Peter's Boswell test in action: https://news.ycombinator.com/item?id=43196405

Expand full comment
Alan Wilhelm's avatar

Hello Peter! This is one of my favorite ongoing performance benchmarks. It is very code focused but that's my main use case. https://aider.chat/2024/12/21/polyglot.html#the-polyglot-benchmark

Expand full comment
Peter Luh's avatar

Thanks Alan for your automating chatbot queries and have them grade each other, paving the way for a Boswell Test that leaves writers feeling "lost without their trusty AI writing companion."

Expand full comment