top of page

AI is Great(ly) WasteFul

  • Writer: Taha Maknoo
    Taha Maknoo
  • Feb 7
  • 5 min read

Back when we were still using keypad phones and trying to switch up to 3G, Nimbuzz was one of the most popular communication apps. Especially for users whose phones didn’t support WhatsApp yet.


As soon as you downloaded Nimbuzz, you were greeted by a bot called “Jimmy Testbot”.

And believe me, Jimmy Testbot would have conversations with you at length about whatever you want. We had a ton of these bots across different apps you could talk to in the 2010s. Shoutout Clippy from Microsoft.


Right now, there’s no Jimmy Testbot.


But there’s something equally better (at least). It’s called ChatGPT (and 100 other AI chatbots).


An Important Overview

Nobody needs an introduction to ChatGPT anymore.


SEOs do their on-page SEO with it. Accountants double-check their ledgers. Meme pages make new stories. And developers build new micro-tools.


Most of us already rely on it to complete a ton of our tasks daily.


However, all the AI chatbots are at an important impasse - their future.


The Development of AI 

We overestimated ChatGPT. At least for now, the text version of it is great - but not enough to cause an information revolution.


Simply put, it’s smart but not smart enough.


On one hand, it can teach you the reverse engineering of a rocket ship like you’re a 5-year-old.


On the other hand, it looks and works like a plagiarism engine that summarizes all the information it can find that others have written – and throws it up in front of you.


I’ll let you be the judge of how you create this distinction.


Will Smith’s spaghetti eating from 2023,


All the way up to 2026 has improved a lot.



AI now allows you to create and edit pictures, make videos that make (some) sense, and code for you.


As good as it has gotten, there’s still a major problem - AI is miles wide but just an inch deep.


The Current Problem With AI

AI is great at answering almost all questions and pointing you in the right direction.

But it’s also terrible at a ton of things.


  • AI is terrible at remembering past conversations.

  • AI is terrible at remembering context in the long run.

  • AI is terrible at creative output. It will follow you blindly.

  • AI is terrible at structuring responses properly.


Of course, there are a ton of problems with how AI performs. Since the tech is young, it’s bound to make mistakes and provide you with a fabricated output. The technicalities of that are something that a tech YouTuber will explain better than I would.


But the most important problem is that it does almost everything, but nothing well at all.

AI has been developing in image and video generation. But neither is it getting good at that, nor are the AI chatbots getting any better.


Imagine if AI development had just been limited to chatbots (at least for now). We’d have such realistic conversations that we wouldn’t think twice before trusting anything it said.


AND THE MOST IMPORTANT PART - Not everything would have gotten as expensive as it is right now.


Let me break it down for you.


The Cost of Text Generation

The conversations you and I have with ChatGPT are text-based. And text is one of the most inexpensive methods of AI output.


Here’s how it works:


Text Generation works on Large Language Models (LLMs).


When you type a message into a chatbot or writing tool, you’re giving the AI a set of words and instructions. These could be a question, a sentence to complete, or a rough idea you want expanded. The AI doesn’t “understand” your message the way a human does, but it recognizes patterns in language based on what it has seen before.


  • It reads your message from start to end

  • It looks at how words usually follow each other in similar situations

  • It predicts the next word that makes the most sense

  • It adds that word, then repeats the process

  • Each new word depends on everything that came before it



And this happens extremely fast - especially with newer models of AI chatbots. And to generate the output,


  • Text appears gradually, like typing

  • The tone and style depend heavily on how you phrase your prompt

  • If you change one sentence in your input, the output can change completely

  • Follow-up questions work because the AI keeps your recent messages as context



And usually, this is inexpensive because:


  • Text is lightweight and fast to generate

  • The AI only processes words, not images or motion

  • Responses take milliseconds, not seconds


In practical terms, generating a paragraph costs fractions of a cent. In fact, even long conversations cost very little. This is why chatbots are cheap, fast, and widely available.


But the case is pretty different when it comes to image generation.


The Cost of Image Generation

To generate images, you have a ton of tools already on the market. Everything from the OpenAI repository to Nano Banana from Google (and everything else in between), image generation and editing are becoming quite accessible - even for free users.


Unlike chatbots, image tools treat your prompt more like instructions than a conversation.


  • The image generation starts with a completely random, noisy image

  • It uses your text as guidance

  • Slowly reshapes the noise into a picture

  • Refines details step by step

  • Stops when the image matches the description


Instead of predicting words, the AI is deciding where shapes, colors, and textures should go.



In terms of costs, image generation is way pricier than text generation, because:


  • Images contain millions of data points

  • The AI must process everything at once

  • Multiple refinement steps are required


Practically, one image costs the same as generating thousands of words. Each image uses significantly more computing power, which is why image tools limit generations or charge credits.


The Cost of Video Generation

Video generation, naturally, is extremely expensive.


You’ve a million data points that feed off an input from a user. And the most important part - it needs to make sense.


You describe a scene, action, or short story. Some tools let you upload a reference image or video. Your input sets direction, but the AI fills in many gaps. And this is how it works:


  • Generates many images in sequence

  • Treats each image as a video frame

  • Tries to keep characters and objects consistent

  • Smooths motion across frames

  • Repeat this process dozens or hundreds of times


The AI isn’t making a video directly. It’s making many images that must agree with each other over time.



How the output works

  • Generation takes much longer

  • Videos are short

  • Motion may feel strange or unstable

  • Errors become more noticeable


In terms of costs, video generation consumes the most resources because every second of video = many images. This means that each image requires heavy computation, which, of course, means that mistakes require regeneration


In practical terms, one short video can cost as much as thousands of text responses, and pricing is often measured in seconds, not prompts.


Now Of Course

Much like every other type of technological development, AI will take time to mature.


And its development is equally important, especially since almost everybody is already pretty much completely reliant upon it.


But the cost we’re paying for this development is an overkill.


Computer equipment prices are significantly higher, resources like water are being used at an enormous rate, and electricity bills are through the roof.


All for what? Just for a better version of Will Smith eating spaghetti? 


Not really!


 
 
 

Comments


bottom of page