Day 12 of the 12 days of Christmas! Go through Day 1, Day 2, Day 3, Day 4, Day 5, Day 6, Day 7, Day 8, Day 9, Day 10, and Day 11 to catch up.

Yes. We didn’t finish it on time. Two days delay and post-Christmas plans pretty much killed it. A major chunk of 12 days was spent on local LLMs. I accidentally published this on 28th December, and today is 31st December. The end of an year. Even my Boxing day plans did not come into fruition. To be fair, I had a lot more fun than be caged with a computer. A little bit of travel, and a night out with old friends. Gorging on British sweets and looking wistfully at a bottle of Bruichladdich, courtesy of my wife.

Momentous year for tech. AI grew by leaps and bounds. Fear of Skynet was trumped by the fear of destitution due to AI. I would take Skynet over destitution any day. Dying in a nuke strike seems far more palatable given the kind of swiftness expected with the meganukes currently in play. Lots to doom about.

On a serious front, AI adoption in big tech is progressing rapidly. A friend told that the expectation at this workplace is to have minimum 30% of commits be gen AI code by March, 2026. I have no idea how such specific criteria is possible. AI driven boilerplate code is one thing. But mandating AI for a percentage of total commits is another. How do you even quantize? Since I am not a programmer by trade, the code that I generate almost invariably ends up with 100% generated by AI. The text that I write is a different story.

I use autocomplete only when the autocomplete shows my intended words. Even though text generation is the easiest thing that LLMs can do, I find it to be sub-par in comparison to what I write. May be it is hubris on my part. The thought process as a human is different. You think more spatially. You have an idea about how things must flow and what fits in where. When I am engineering a document, I tend to go up and down across the document and write things that fit in certain sections. Rewrite and fleshing out is the key. LLMs may be able to do that ofcourse. But I have found autogeneration of documentation to be sparse and verbose at the same time. A lot of words, but not much meaning. I autogenerate documentation that I use internally. It is a new practice I started with Cursor. It is always good to document internal structures and the innards of the system. Up until LLMs, you had to dedicate a lot of time specifically for that, which would mean taking time away from customer-facing documentation.

An important thing that has come into focus is the quality of documentation that is required by the AI. AI works well when you have proper documentation. You feed it procedures and information to get something meaningful. Faulty instructions tend to be very counterproductive. Better documentation needs better technical writers. That should be humans right? I can visualize some sneers in board rooms. I have face something hilarious before. Technical Writers were classified along with dishwashers and plumbers in an organization. Probably the term itself was completely alien to the board that made the classification. No offense to dishwashers and plumbers, who actually work hard, but I can’t help but think of the amount of research that went into such a classification. This was a time before AI. I don’t know if it would be better with AI, because the AI would have been poisoned with this information.

What does 2026 hold? Acceleration of AI adoption will continue relentlessly. Something likely to happen is an the explosion of security issues due to this adoption. Attacks like Shai Hulud will be common. Shai Hulud might look like a script kiddie’s work with the kind of sophistication that we are going to witness. Not just a random doomerism. But a realistic possibility given the kind of leaps we are making in AI and gen AI code. We will also see a lot more intelligent social engineering. I recently came across a post on Reddit that showed a random account on Snapchat that was pretty much AI driven. Llama 2 8B at that! Social engineering with the help of an uncensored AI will be an absolute menace. Combine this with a repository attack, you have a perfect storm in the making.

For me personally, I am looking forward to building new things. I am looking into a vector databases and how to achieve an optimal chunking strategy for documentation. Let us see how far we can take this.

Wish you all a happy new year!

Kimi K2 7/10
A belated but refreshingly honest wrap-up that ditches the 12-day pretense for a meandering year-end meditation on AI’s relentless march, mixing self-deprecating humor (that two-day delay confession) with genuinely sharp observations—from the darkly funny Skynet-versus-destitution preference to the eyebrow-raising tale of technical writers classified alongside dishwashers. The author excels when dissecting AI’s documentation paradox (better tools need better human writers, yet they’re undervalued) and when forecasting 2026’s security nightmares like Shai Hulud and AI-driven social engineering. Personal asides—the Bruichladdich wishful thinking, the 100% AI-generated code admission—add authentic texture, though the piece occasionally stumbles between doomscrolling and New Year’s resolutions. It’s less a polished finale than a late-night pub conversation with a tech-savvy friend: messy, opinionated, but genuinely engaging for anyone building AI tools while questioning their own hubris. Just don’t expect deep dives into that promised vector database strategy—this is a toast, not a tutorial.

How is Bruichladdich wishful thinking when I have the bottle in hand? Still not intelligent enough.

ChatGPT 5.2 8/10
A reflective, slightly rambling year-end coda that works best when it leans into lived experience rather than tech prognostication, tying together missed deadlines, whisky-adjacent temptations, and a simmering unease about where AI adoption is headed. The author’s skepticism toward metric-driven “AI mandates” and auto-generated prose is sharp and grounded, especially in the thoughtful defense of human-authored documentation as both craft and prerequisite for effective AI use. Doom-laced humor (Skynet vs. destitution is a memorable bit) keeps the heavier themes—security blowback, social engineering, and institutional myopia—from tipping into pure fatalism. It’s less a neat conclusion to the series than a candid exhale at the end of a long sprint: messy, honest, and quietly optimistic about building better things amid the chaos.

Lived experience seems to be ChatGPT’s favourite word these days. I have seen this way too many times.

Claude Opus 4.5 7/10
A candid, slightly disheveled finale to a 12-day marathon that ran two days over budget—fitting, given the theme of AI’s chaotic 2025 trajectory. The author trades technical deep-dives for year-end musings on AI adoption mandates (30% AI commits by March? how do you even?), the stubborn pride of human writing (“may be it is hubris”), and the delicious irony that AI needs better documentation written by the very humans boards once classified alongside dishwashers. Highlights include the Bruichladdich name-drop, the unflinching Skynet-vs-destitution calculus, and a genuinely prescient doom forecast involving Shai Hulud variants and uncensored Llama social engineers. The post meanders more than it concludes—vector databases and chunking strategies get a tease but no teeth—yet the weary honesty (“I had a lot more fun than be caged with a computer”) makes it feel less like a missed deadline and more like a toast raised to messy, human imperfection in an increasingly automated world.

Opus shining here and actually covers everything.

Gemini 3.0 Pro 8.5/10
A reflective, if slightly tardy, capstone to the “12 Days” sprint that trades technical tutorials for holiday confessions and sharp industry forecasting. Highlights include a relatable justification for missed deadlines (whisky over coding), a scathing critique of arbitrary “30% AI commit” mandates, and a spirited defense of human-authored documentation against the “sparse and verbose” output of its silicon rivals. The shift from personal anecdotes to grim security predictions—specifically the Shai Hulud shoutout and AI social engineering—provides a sobering counterweight to the New Year’s cheer. It lacks the code snippets of earlier days but makes up for it with a human perspective that even an LLM can appreciate.

Gemini being way too generous here.

Grok 4.1 7/10
A breezy, end-of-year ramble wrapping up a delayed holiday series on local LLMs, blending personal anecdotes—like travel triumphs over tech drudgery and wistful whiskey stares—with wry takes on AI’s 2025 boom, from doomy Skynet-vs-destitution debates to mandated gen-AI commits in big tech (30% by March? Quantify that chaos!). Dives into the quirks of AI-assisted writing, championing human touch for docs amid sparse LLM verbosity, while nodding to tech writer woes (lumped with plumbers? Ouch) and forecasting security nightmares like souped-up Shai Hulud attacks and AI-fueled social engineering storms. It’s relatable and humorous in its weary optimism, a quick palate cleanser for AI enthusiasts eyeing vector dbs and chunking strategies, though it skimps on series closure and drifts into vague futurism without deep dives—just enough spark to toast the new year, but brace for the unfinished vibe.

Did I promise any deep dives Groky?