Passkey has a theft detection feature, but Apple, Google and Microsoft broke it

May 30, 2025

TL;DR: Passkey has a clever design that allows users to know if your keys are silently stolen and cloned, but the feature is currently not very useful today because its major providers such as Apple, Google and Microsoft do not play ball.

Passkey is all you need

Passkey adoption is accelerating. At this stage, it is generally considered ubiquitous enough to be implementing it for web services and apps and dropping traditional insecure password authentication mechanics.

Passkey is not just a second-factor authentication (2FA). It is secure and widely-adopted enough that it should be the only authentication mechanism. If you do not believe me, believe Microsoft. Microsoft has, from May 2025 onwards, adopt passkey for passwordless authentication for its accounts. Well done, Microsoft!

Passkey is great UX with no security penalty

It is commonly believed that security and usability have an inverse relationship: enhanced security often comes at the cost of user convenience and vice versa. Passkeys, however, disrupt this assumption entirely. They offer robust security while simultaneously providing an exceptional user experience, not merely matching the traditional email-password model but significantly surpassing it. With passkeys, both registration and authentication become seamless single-step processes, dramatically improving both user convenience and security.

WebAuthn, short for Web Authentication, is an open standard developed by the World Wide Web Consortium (W3C) and the FIDO Alliance to facilitate secure and passwordless authentication on the web. In short, Passkeys are a consumer-friendly implementation of WebAuthn, enabling seamless logins through biometrics or PINs without traditional passwords, and adhering to a standard set of protocols.

At its core, the system uses public-key cryptography, wherein each user holds a unique private key securely stored on their device. The corresponding public key is registered with the website or service provider during initial registration.

During authentication or log in, the device or authentication generates a cryptographic signature using the private key, which the relying party (RP), also commonly known simply as server, verifies using the public key, confirming the user's identity without ever sharing sensitive data.

Theft detection

One of the rarely documented features of Passkey is that it has a very clever feature to detect clone or theft.

You have your security credentials and you think that they are safely stored in your passwords vault, encrypted and kept away from prying eyes and hands. But how would you ever know that your passwords were never peeked and copied or your security vault was never cloned and broken into? Even if you have your printed, handwritten or embossed private keys or passwords stored securely in a physical vault, and you routinely check to ensure that they are physically in a vault, you cannot be certain that they are not cloned, for instance by taking a picture and routinely logs in to your account to read your mails.

Passkey has a very clever solution to that problem, if only everyone would follow the defined specifications.

Brilliance of `signCount`

An essential yet often overlooked feature of WebAuthn is the signature counter [W3C spec], or signCount. Every time a device successfully signs or authenticates, it independently increments this counter and sends it alongside the authentication request. Servers compare the received signCount against the previously recorded value:

// Server-side verification of signCount
function verifySignCount(authenticatorData, storedSignCount) {
  const currentSignCount = authenticatorData.signCount;
  if (currentSignCount <= storedSignCount) {
    throw new Error("Potential cloned or compromised device detected.");
  }
  
  // Update stored signCount
  storedSignCount = currentSignCount;
}

This mechanism effectively detects cloned or compromised authenticators, as a cloned authenticator would inevitably provide an outdated or duplicated signCount value, raising immediate security flags.

Current state

Despite the elegance and effectiveness of signCount, major passkey providers, Apple, Google, and Microsoft, currently implement it inadequately for synced credentials. These providers always return a signCount value of 0 for passkeys synced across multiple devices, rendering the clone detection feature ineffective.

Apple, unofficially, blames it on synchronization challenges across multiple devices that synced passkeys cannot support signCount.

Is synchronization really the problem?

Providers claim synchronization complexity as the barrier to implementing signCount. This seems like a lazy excuse that does not hold much water. Providers like Apple's iCloud and Google's Password Manager routinely sync sensitive data such as passwords within mere seconds. The incremental synchronization of a simple integer counter, even with eventual consistency, is technically straightforward.

Eventual consistency syncing is acceptable

Eventual consistency syncing of signCount is actually acceptable, definitely better than not providing them at all as today. Eventual consistency syncing still retains the ability for key cloning detection. All you have to do is introduce some allowance before you flag a key as compromised. Here is how that would look like on the server-side:

// Server-side verification of eventual-consistent signCount
const allowance = 3;

function verifySignCountWithAllowance(authenticatorData, storedSignCount) {
  const currentSignCount = authenticatorData.signCount;
  if (currentSignCount <= storedSignCount) {
    if (storedSignCount - currentSignCount > allowance) {
        throw new Error("Potential cloned or compromised device detected.");
    }
  }
  
  // IMPORTANT: Save the larger of the known and received signCount
  storedSignCount = Math.max(currentSignCount, storedSignCount);
}

Security gotcha alert!

As signCount is not widely implemented today, most relying party either not save or validate signCount, which is a pity for such an elegant feature, or implement selectively only for authenticators that provide non-zero signCount.

This is generally fine, but be careful about the following 2 points:

You must not be relying on passed-in signCount to decide on whether to perform validity check, instead to rely on both stored signCount and newly passed-in value. If any of these 2 are not 0, perform signCount validation.
- Using passed-in signCount would allow authenticator to pass in 0 to bypass validation.
At the final step when saving the updated signCount in database, make sure you store the larger of existing stored value and the newly passed-in value.
- Simply storing the newly passed-in signCount would allow cloned device to gradually, within allowance, to bring down the stored signCount, even all the way to 0.

Passcay - Zig passkey library

I learned about these as I was implementing Passcay, a free and open source Zig passkey library. Been wanting to write the blog for awhile but have been very very occupied working on my upcoming projects, which I will be sharing soon.

Passcay has all the above built-in, allowing a relying party to implement Passkey registration and authentication safely, securely and easily.

The various checks that are included in Passcay can be summarized here:

// Set up the authentication expectations
// It is important to verify the challenge and origin to prevent replay attacks.
const auth_expectations = passcay.auth.AuthVerifyExpectations{
    .public_key = user_credential.public_key,
    .challenge = challenge_from_session,       // The challenge generated in Step 1
    .origin = "https://yourdomain.com",        // Origin of your web app, or null to skip origin check
    .rp_id = "yourdomain.com",                 // RP ID for your domain, or null to skip RP ID check
    .require_user_verification = true,         // Whether user verification is required
    .require_user_presence = true,             // Whether user presence is required
    .enable_sign_count_check = true,           // Enable sign count checking if applicable
    .known_sign_count = user_credential.sign_count,  // Current sign count from database
    .sign_count_allowance = 1,                 // Allow some deviation in sign count for eventual consistency
};

You can learn more about it at the registration and login documentations.

Now's the time for passwordless

If you are still unsure if it is safe to go fully passwordless today, yes it is, in all fronts: security, user experience, and adoption rate.

In fact, you should also consider decoupling from email address as account identifier anti-pattern. More about that in the future!

Passcay: Zig library for passkey auth

May 4, 2025

Released Passcay over the weekend. Passcay is an open source Zig library for Passkey relying party, i.e. websites that want to support secure Passkey authentications. Passcay is pronounced exactly the same as "passkey". In hindsight, I probably should have named it Passquay, still pronounced the exact same way, to add a touch of Singaporean flair to it.

I’ve been using Zig as my primary language for about 1.5 month now, and I’m feeling increasingly confident with it. I do miss a few things from Rust — especially ownership and the memory safety it enforces — but overall, I’m happy with the switch. One downside: LLMs don’t seem to work as well for Zig as they do for Rust or JavaScript, likely due to the smaller corpus of example code available.

I made Passcay to solve my own problem. I am working on a web service that would be having passkey as native authentication rather than as 2nd factor authentication, above the insecure and privacy-affecting email-password login.

Learned a lot about WebAuthn, Passkey and especially Zig. Passcay currently links operation system's OpenSSL library for cryptographic verification as I could not seem to get ES256 and RS256 verifications to work solely through Zig's crypto standard library, std.crypto. I might be wrong and that std.crypto is complete and sufficient for Passkey validation. I am looking forward to pull requests or guide in that direction.

If you are a Zig developer and are looking for an easy and secure way to support passkey in your project, check out https://github.com/uzyn/passcay.

OpenAI Codex

April 17, 2025

Another day, another AI coding agent. OpenAI just announced Codex, modeled after Anthropic's Claude Code.

I have just found my happy place only a few days ago with Claude Code and Visual Studio Code Agent Mode + GitHub Copilot. Nonetheless, happy to be evaluating Codex. OpenAI's UX are usually polished and what I like about it already is that it's open source. This means it will get better so much faster than many of their competitors.

Tried to peek under the hood of Claude Code last week but it's all obfuscated.

Anyway, engineers, do the dance: :-)

ln -s AI.md .cursorrules
ln -s AI.md CLAUDE.md
ln -s AI.md .github/copilot-instructions.md
ln -s AI.md codex.md

Guest Lecture at NUS Business School: The Open Web

April 8, 2025

Thank you Bernard Leong, a friend, fellow entrepreneur & technologist for the kind invitation.

It was an honor to return to my alma mater and speak at National University of Singapore (NUS) Business School for the Master of Science in Venture Creation students about a long-time passion of mine: the open web. In my talk, I covered the inception of the web in the 1990s, the destructive influence of social media algorithms, and the promise of the fediverse and open protocols in shaping the next phase of the web. I also explained why Web3 is not part of that evolution.

Tech companies today should look into integrating open source into their business strategies and contribute to building open protocols. I shared case studies on companies that have successfully done so, Anthropic's Model Context Procol (MCP), Meta's Llama & Singapore's own Menlo Research.

Presentation deck

Fun note: Slide 2 features a screenshot of an early interview Bernard's team did of me back in 2006 on SGEntrepreneur (now Tech in Asia). How time flies.

Students of MSc with U-Zyn at NUS Business School

Fun project 16 years ago

April 8, 2025

Fun project. 16 years ago today, in 2009, I built this dashboard for IMDA (then Media Development Authority of Singapore [MDA]) to showcase the traffic to MDA-funded startups.

IMDA Dashboard from 2009

It was a "simulated" live data of all visitors to all the tracked sites, with the data source being Google Analytics. At least one of the startups then is now a unicorn. You can probably guess which one - Sea, then Garena.

It was meant to be a permanent display at MDA's office, the dashboard would animate, with spinning globe, dropping pins and zooming into busy regions unsupervised while allowing for manual controls when needed.

The key challenges back then was how to get this to run on limited resource on a consumer desktop computer (not GPU servers). Unsurprisingly, Mac performed much better than Windows. Windows struggled to display it in an acceptable framerate.

A series of rabbit holes

March 21, 2025

My journey started casually enough—working on Tixie, I was looking into simpler ways to deploy Rust-based Lambda functions when I came across Cargo Lambda. What got my attention was that it is so easy to cross-compile with it, mainly due to Zig's powerful build system. Even though it was built for Rust developers, it utilizes Zig for cross-compilation, allowing me to compile ARM binaries that worked on AWS Graviton on my Apple Silicon Mac.

That was the start of a series of rabbit holes that consumed me for more than a week.

Rabbit hole #1: Zig

I have heard about Zig prior but had not looked into it deeply. This new brief encounter quickly turned into a deeper dive, pulling me into an engaging exploration. It also seems like Zig community consists of quite a significant "Rust refugees".

Rust is memory-safe, extremely, but it can be rather annoying and unforgiving for developers on many of its designs. Personally for me, what annoyed me the most is its error handling.

Notable projects such as Bun—a JavaScript runtime gaining popularity—are turning to Zig for its simplicity, clarity, and efficient build system. I spent more than a week learning and diving deep into Zig – I like it so far for its C-like syntax, being a low-level language like Rust, and more straightforward language design choices compared to Rust.

Rabbit hole #2: Jetzig

I am working on a project which I have started on Rust for a bit.

As I got deeper into Zig, I stumbled upon Jetzig, a Model-View-Controller (MVC) web framework built on and for Zig. What particularly intrigued me about Jetzig was that it seems to be very suitable for HTMX and Hypermedia Systems, a development stack and philosophy that I gravitate towards lately for my goals in simplifying build process.

I spent the next few days learning and experimenting with Zig and Jetzig.

Rabbit hole #3: Zmpl

Jetzig's built-in template language is Zmpl. However, with it being a rather new language, there was one problem — working without syntax highlighting felt uncomfortable.

No Zmpl syntax highlighting extensions was available yet on the VS Code extension marketplace. Zmpl's page did mention one. Unfortunately, I could not get the extension installed. Also, looking into it, it seemed like it lacked some comprehensive definitions. I thought about fixing it by creating one; how hard could it be?

Rabble hole #4: tmLanguage and VS Code extension

First thing to learn: tmLanguage, a syntax grammar definition structure originally created for TextMate. It is still widely used by most popular code editors today, surviving TextMate, such as Visual Studio Code, Cursor, Sublime Text, and Atom. Unfortunately, resources for tmLanguage were strangely limited. This blog by Matt Neuburg was helpful in explaining the concept. Interestingly, it was written more than 11 years ago in February 2014.

That took a few days of intense development, testing, and refining and finally created something functional. While it wasn't perfect especially for multi-level nested mode switches, it was a significant improvement over existing solutions and met my needs for now.

I released my Zmpl syntax highlighting extension on the VS Code Marketplace and open sourced it.

Preview in Monokai theme:

What started as a small personal inconvenience turned into multiple rabbit holes that consumed me for more than a week, but emerging with an appreciation for Zig, and learned about tmLanguage and the VS Code ecosystem.

Now, time to get back on my original project.

The Roadmap Pitfall: Companies Should Not Sell Tomorrow's Features Today

March 13, 2025

The Roadmap Pitfall

If you run a company, do not release roadmaps or make future promises. You owe your customers nothing beyond what they're paying for right now. Your customers pay for what you sell today, not what you will be selling tomorrow.

I call this the Roadmap Pitfall. It is a common mistake and trap that many companies, especially startups, fall into. The tech landscape is littered with companies that have overpromised and underdelivered through their public roadmaps, hurting their reputations. Some infamous examples are Tesla's Robotaxi, Meta's Libra and Metaverse, Google Stadia. They are selling products that might become revolutionary tomorrow but not today.

Apple

Apple Intelligence

The latest victim is, surprisingly, Apple – yes, the world's largest company. Apple historically not only avoided publishing roadmaps, but went to extraordinary lengths to keep upcoming product leaks and rumors at bay. In 2005, Apple sued people who leaked product details online, and famously forced the shutdown of Think Secret, a prominent rumor website, as part of a settlement in 2007.

Yet they've now fallen into the Roadmap Pitfall with Apple Intelligence – announcing features before they were anywhere close to ready and setting expectations they couldn't meet. Apple has started removing features from their iPhone 16 and Apple Intelligence page.

In a statement last week, Apple confirmed that it needed more time to finish developing the new Siri features. Apple said it aimed to begin rolling out the features "in the coming year," but it did not provide a more specific timeframe. from MacRumors

I agree with the widespread criticism of how badly Apple Intelligence flopped, not only regarding delays but also on features that were actually released. Notification summaries are so problematic that Apple had to disable them for news applications after generating inaccurate content. Writing tools are hardly usable in their current state, and Image Playground was fun for approximately 30 minutes – the total time I've spent on it – before my iPhone heated up significantly.

I still think the idea of Apple Intelligence – AI that lives and runs securely on your device – could be a massive life-changer. If I were to bet on the company that would deliver "AI in your pocket" well, with great UX without compromising security, it would be Apple. My issue isn't with the vision but with the over-promising of features that were nowhere close to being ready. In fact, I would have been perfectly fine with an iPhone 16 Pro with no AI. Let the competitors play the feature one-upping game with their gimmicky flip-phones and touchscreen-laptops.

Delight your customers

Roadmaps and projections are for investors, who might pay a premium in your company's valuation, counting on you being able to deliver the promised goals. Customers are not investors – they buy your current products and services, not your future promises.

The next time you see a company selling dreams and futures, with "coming soon" banners plastered across their website, proceed with caution. There are often significant problems beneath these promises – forcing them to sell what-could-have-beens rather than their products. If you're running a company, remember: it's better to surprise and delight customers with unexpected new features than to disappoint them with promises that never materialize.

Introducing Tixie

March 9, 2025

Spelling, or 听写, is a very familiar process for parents of Singaporeans with primary-school-going children, or in fact, for parents worldwide. Every week, each of our children is to learn two sets of words and phrases, one in English and one in Chinese, to be tested in school. Each set consists of 15-20 phrases and sentences.

A usual process for us would be first having each of our children study, on their own, the words or phrases for the upcoming spelling test, and once they are ready, passing us parents the list for us to read aloud to test them. I'm sure many parents in Singapore are very familiar with it. Rinse and repeat every week.

I created Tixie to facilitate the latter read-aloud process, allowing it to be conducted solo by the students only, without involving parents or another person.

Try it out at https://tixie.org. It is free to use!

Features of Tixie

Tixie's user experience consists of:

Creating a spelling list.
Read the list back, one line at a time, in optionally random order.
Shows the final list in the order being read out to allow students to mark on their own.

Tixie code

To save time from having to recreate the same list if a child would like to retake the test, Tixie also automatically saves the list and provides a 9-digit Tixie code for the test to either be retaken at other times or shared with peers. Try out the code above.

Teachers can also pre-create a list and share the saved Tixie code with the students in class for their self-practicing at home.

Mascot 🧚

Tixie is a wordplay of tīngxiĕ (听写）and rhymes with pixie🧚. The logo is in vector form, designed and created by my 8-year-old daughter, Melinda, on Scratch (check out the source!).

She publishes her Scratch projects from time to time, give her a follow if you think her work is cool.

Serverless technology stack

Tixie is entirely serverless as I do not want to maintain servers and infrastructure if I can avoid doing so.

Frontend:

Pure HTML
Vanilla JS
Tailwind CSS
Served statically via AWS CloudFront CDN

I am not a big fan of monstrous and complicated build-steps. I deliberately chose not to use React, JSX, or any sophisticated JavaScript libraries. Vibe coding via Claude Code does help in managing vanilla HTML and JS codebase without over-complicating things.

Backend:

Rust-based Lambda functions using Cargo Lambda
DynamoDB as NoSQL database
Function hosted via AWS Lambda.

The backend is solely for saving and retrieving saved spelling tests; serverless architecture fits such requirements perfectly.

While I am not new to Lambda, this is my first time doing it in Rust. My past usage has been via Node.js. Cargo Lambda helps in local testing, and most importantly, it supports cross-compilation for ARM processors on my macOS-based laptop – allowing the final production code to run super efficiently on AWS Graviton, a highly efficient ARM-based processor and with very tiny memory footprint.

The code runs on AWS Lambda without requiring any runtimes, which Node.js scripts would. That helps significantly with the often-criticized slow cold start oftentimes associated with Lambda functions. Tixie's API is lightning fast and yet very light!

No artificial intelligence

While it is not meant to be a feature, it is crucial to point out that the current version of Tixie contains no AI components.

Speech is generated entirely on the client devices on which Tixie loads on. This is why there needs to be a language selection at the top, to allow language mapping to commonly-available voices on devices.

const shortlistedVoices = {
  en: ["Samantha", "Reed", "Flo", "Eddy", "Sandy"],
  zh: ["Tingting", "Ting-Ting"],
  ms: ["Amira"],
};

I have only tested these on some of the devices I have access to; if you would like to add other voices or languages to the shortlist, please drop me a note.

The project did not start out without AI. The early prototypes/experiments with some AI components, and that made it somewhat cost-prohibitive for a free app:

Voice-only use, using real-time voice AI API. Users can say "Next," "Repeat the word," or something similar in another language to control the flow entirely via voice. Users can leave their device face down when taking the test.
Automatic test setting. A user just has to take a photo of the test sheet given by the school, and the test will be automatically populated.
Automatic marking. At the completion of a spelling test, the user can snap a picture and have it automatically marked. This process involved prompt engineering and some fine-tuning, especially for Chinese handwriting recognition, and being a little bit more forgiving in handwriting recognition for the users, whom are often kids below the age of 10.

Early prototype

I made an early prototype. You can find it at https://uzyn.github.io/tixie/ while stocks last. Took only a few hours to make.

It was done without server-side component, with the list save-able entirely using URL parameters. My children were using these for the past few weeks before Tixie.org is ready.

Key takeaways with URL parameters for real-world use:

Apple's iMessage strips away anchor tags, the stuff after # in a URL.
Encoding and decoding gotchas for non-ASCII characters.

Enjoy

Let me know if you have any feedback, suggestions, bug reports or simply want to say hi.

Have fun with Tixie! https://tixie.org/

Digg is coming back

March 6, 2025

Digg is coming back

Digg is coming back!

Digg was one of the very first social news aggregators back in the 2005-ish. It's one of the inspiration that led to me creating and launching Ping.sg in 2006. Ping.sg was a social blog aggregator for mainly Singpaore blogs.

To me, that's a sign of revival of free and open web independent of big techs.

I'm happy to see it. It's a glimpse of a validation of a few things that I am working on in the space of open web.

I like what my friend Ridz call it: Make the web personal again.

KahWee's AI-generated blogging engine

Also check out my friend KahWee's new blog. It is not built on any blogging engines nor any off-the-shelf static site generators. It is built entirely with Claude Code!

Hidden messages from OpenAI GPT 4.5 announcement

February 28, 2025

Watch the OpenAI GPT 4.5 announcement video.

Messages from the announcement, some obvious, some not so:

GPT 4.5 is going to be awesome.
We got a glimpse of GPT 1 (08:34), circa-2018. Spoiler alert: it sucked.
The engineering team that worked at OpenAI is just as awkward in social settings as your company's engineering team.

o3-mini hallucination rate is 80%

o3-mini has 80% hallucination rate! That's shocking! That explains why I have not had good experience with o3-mini.
Key hidden message: GPT 4.5 is trained aggressively at low precision (07:40), a technique popularized recently by DeepSeek.

Low-precision training

This could normalize the way how models are trained in the future. I'm not sure yet if this is good or bad in the long term because we may not know what we might miss at high precision with improved techniques and compute resources if we continue to train at low precision due to the pressure from competitors.

Before DeepSeek, training was often done at high precision, requiring more resources. Precision was usually reduced (quantized) only at testing (interference or usage) time to reduce compute requirements for everyday use.

It is also important to note that while DeepSeek demonstrated how effective low-precision training can yield effective results, DeepSeek's team did not invent low-precision training.

Claude Code is impressive

February 27, 2025

My workflow with generative AI has been changing quite rapidly over the past two weeks. DeepSeek's release really sped up innovation and release cycle among its competitors.

I may draft a longer note covering my workflow next time; I'm writing a quick one to share how impressive Claude Code is.

Claude Code hit a homerun

I have been trying, and failing, to get this straightforward task executed by GitHub Copilot using a series of models and tweaking of the prompts. Here's the gist of the prompt:

Extract inline SVGs from index.html into individual SVG files and load them via <img> for better code and file management.

It seems relatively straightforward, but GitHub Copilot managed to get five strikes with it. Models I have attempted include both Claude 3.7 Sonnet (Preview) and Claude 3.7 Sonnet Thinking (Preview).

GitHub Copilot would either manage to screw up the alignment of the page or tweak it so much that it affects the Document Object Model (DOM) of the page and affects its accompanying JavaScript.

I decided to check out Claude Code, which has been all the rage lately among the software engineering community. Lo-and-behold, it got it right at the first attempt! The document DOM stays fully intact, the layout remains perfect, and my JavaScript continues to work fully.

Claude Code's final output:

The SVGs have been successfully extracted from the HTML file and properly organized in the assets/icons directory. The index.html file has been updated to reference these SVG files using thetag instead of embedding them directly in the HTML.

Here's a summary of the changes:

Created an assets/icons directory for SVG files

Extracted 8 SVG icons from the HTML into separate files:

mail.svg - Email icon

back.svg - Back arrow icon

forward.svg - Forward arrow icon

share.svg - Share icon

sound.svg - Sound icon

refresh.svg - Refresh/repeat icon

volume.svg - Volume icon

chevron-right.svg - Right chevron icon

Updated the HTML file to usetags to reference these SVG files

Updated the JavaScript file to usetags for dynamically generated content

This improves code and file management by:

Reducing HTML file size

Making SVG icons reusable across the application

Enabling browser caching of SVG files

Making it easier to update icons in the future

Improving code readability and maintainability

The application should function identically to before but with a cleaner organization.

Per-use billing

Another thing I love about it is that it is billed through Anthropic API rather than through Claude's subscription plan. It's a win-win for both parties: Anthropic does not have to worry about overutilization with a fixed-cost subscription plan, and I do not have to worry about subscription fatigue.

Thoughts on Perplexity Deep Research, after 24 hours of heavy use

February 16, 2025

Since the introduction of Perplexity's Deep Research on Feb 14, 2025, now the 3rd major deep research AI offerings after OpenAI ChatGPT's, and Google Gemini's, I have been using it rather extensively. It helps greatly on a research project I am working on.

Early findings on Perplexity Deep Research, it often takes 5-20 minutes, not quite 2-4 minutes as advertised. Perhaps it is due to the spike in usage since the introduction, or perhaps it is a tweak to make it appear as if it's working comparatively hard as the other 2 Deep Research offerings. Overall, I do not have an issue with the response time. I can also see its thought process throughout the thinking time.

I am a subscriber of ChatGPT Plus ($20/month) and Perplexity Pro ($20/month). As such, I do not yet have access to ChatGPT Deep Research, which, as of today, is only available to ChatGPT Pro ($200/month) subscribers.

My preliminary findings on Perplexity Deep Research after about 24 hours of rather extensive use.

It takes 5-20 minutes and hardly 2-4 minutes, as mentioned.
It likes Reddit a lot. The cited sources tend to include Reddit a lot if you allow social sources. On the web version, I would often explicitly exclude social sources, but on the mobile or desktop version, it seems to be lacking such an option – I prefer including both web and academic sources but not social sources.
Its response seems rather brief (~1000 words) as compared to what I have seen on ChatGPT's Deep Research. It often leaves me wanting to learn more by asking follow-up questions. I am on the fence if this is better, as a brief response allows me to consume its response within 2-3 minutes, as compared to what I seen some ChatGPT's Deep Research users do, feeding its response to Google's NotebookLM to help in digesting its very lengthy response.
Perplexity goes into deep research mode on the first query immediately, without any clarifications first, like that observed on ChatGPT Deep Research. I do not like this as much for the following 2 reasons:
1. It forces the user to be highly verbose in prompting because you do not get a chance to clarify after LLM's first take. This takes up a lot more effort and guesswork. If you miss it, you get only to correct yourself 15 minutes later.
2. It makes a wrong interpretation of your intention and gives you output that is not accurate. One example for mine would be as I asked it to research the early history of the web, and it covered broadly on the ARPAnet and Internet.
It has no usage cap, which is good!
UX issue: I like to see its chain of thoughts, but if you navigate away, often because you want to start a new thread as Perplexity works, you are unable to get back to an in-progress thread before the response is generated – which would then hide the thought process.
Bug: This is likely a teething issue. I would be losing threads or find the same queries being generated multiple times.

Overall, I am a happy user. I am looking forward to when ChatGPT's Deep Research is made available to Plus users. I now split my usage as follows:

ChatGPT 4o/o1 for usual queries, quick research and planning tasks.
ChatGPT o1/o3 or Claude for coding assistance.
Perplexity Deep Research for... deep research with citations.

OpenAI on Open AI Strategy

February 2, 2025

Q: Would you consider releasing some model weights, and publishing some research?

Sam Altman: yes, we are discussing. i personally think we have been on the wrong side of history here and need to figure out a different open source strategy; not everyone at openai shares this view, and it's also not our current highest priority.

Source: AMA with OpenAI's team

It is interesting that Sam Altman said what he said, considering the ongoing legal battle between Elon Musk and OpenAI on the similar topic. It is even more intriguing that DeepSeek appears to be a catalyst for this shift in thought.

Among some of my peers, there have been some general extrapolations concerning the DeepSeek shockwave, suggesting that China is demonstrating to the world, particularly the U.S., how an open-source strategy can be effective. However, this conjecture seems far-fetched, as open-weight AI models not only originated in the U.S. but thrived. The real catalyst for popularizing open-weight models was Meta's Llama, released in early 2023. Many of the open models available today are modeled or structured based on Meta's Llama.

Regardless, this is beyond country lines and should not be politicized.

Open models are compelling but challenging to monetize. For Meta, this is manageable because they have substantial financial resources and revenue streams independent of their AI initiatives. This has given Meta ample room for experimentation, including the not-so-successful metaverse venture that cost hundreds of billions of dollars.

For OpenAI, however, AI is their sole focus, making it difficult to convince their board to adopt an open approach. This is even more challenging considering the switching cost of their clients is negligible.

We have witnessed how open-source technology has enabled companies and researchers across national borders to build upon previous knowledge more easily, leading to better results and products in a remarkably short time frame.

For the sake of humanity, I sincerely hope that this trend persists.

One concern I have, however, is the increasing weaponization of chips, which could pose a significant barrier to innovation.

Switching cost for AI models is $0

January 27, 2025

It must have been be a challenging week for the folks at OpenAI and other similar closed-source foundational AI model companies. The release of DeepSeek, which is not only better than OpenAI's o1 model but also free, has sent shockwaves through the industry. The fact that DeepSeek has managed to achieve this feat with only a fraction of the resources that it took for other comparable models to be trained is even more surprising.

Foundational closed-source AI companies are usually making a loss on the consumer products through the likes of ChatGPT Plus, Claude Pro. It is also commonly believed that these consumer subscription-based AI products do not bring in much profits. It is not surprising to find that some of these subscription plans are even loss-making, such as that of $200 a month of ChatGPT Pro plan.

The foundational AI companies are still figuring out ways to turn profitable. The revenue model today is largely relying on the API and enterprise sales, i.e. the "AI wrappers" and existing apps racing to support AI feature.

Sam Altman: The honest answer is we have no idea [on how to bring in revenue]. We have never made any revenue. We have no current plans to make revenue. We have no idea how we may one day generate revenue. We have made a soft promise to investors that once we've built this sort of generally intelligent system, basically we will ask it to figure out a way to generate an investment return for you. [The Journal by WSJ]

While we, fellow AI enthusiasts, are excited seeing newer models outdoing one another every other day, the reality is that these models are all based on the same foundational API. This makes the switching cost for AI models for API users is practically close to $0. With DeepSeek's API reportedly costing less than 5% of OpenAI's o1. The costs can potentially be even lower if you run the open source model on your own servers, reducing it to only hosting costs. Memories, fine-tuning (LoRA and the likes), and retrieval-augmented generation (RAG) applications can be ported to competing models with relative ease, in fact more so with open source models than the closed-source counterparts.

DeepSeek proposed a cut-price fee offering for accessing and using DeepSeek-R1, at 16 yuan ($2.20) per million tokens, considerably less than OpenAI's o1 438 yuan ($60.00) for the same usage. [Reuters]

It is going to be really challenging for foundational AI models to figure out a way to create stickiness or introduce switching costs, artificial or otherwise, similar to how it is for cloud operators, especially email and office suite providers. A Google Workspace user will not be switching just because a competitor is offering email services at 50% discount, even if the competitor provides 100% of all the services available at Google Workspace. Technical and data migration pain is sticky, for better or for worse.

Self-publishing

January 13, 2025

A proud moment that started with a simple catch-up with a good friend.

I briefly shared that I am working on a project related to self-publishing. My friend, Chandra, shared that he has been considered blogging and asked me to set one up for him on the spot. He also shared that he would prefer self-publishing than contributing content on social media platforms. That was music to my ears!

I am on a mission to bring self-publishing back. More on that later.

Being put on the spot, I was thinking what would be the best solution to enable self-publishing without over-complication on the writing process and being light on maintenance.

I thought of WordPress.com within a few seconds for many reasons: mature user-facing ecosystem, straight-forward UI, low maintenance. Sure, I am aware of the mixed views on surrounding Matt Mullenweg/Automattic/WordPress, but WordPress.com is possibly, today, still the best self-publishing low-maintenance software out there.

Chandra blogs at cmr.sg. Despite being newer than my site, there has been a lot more postings at cmr.sg. I have subscribed to it.

It is 2025, build like it's 2005!

January 10, 2025

Let’s Build Like It’s 2005

Software engineering has become overwhelmingly complex.

Want to build something today? Good luck. First, you’ll need to wade through an ocean of decisions: which tech stack to choose, how to set it up, which JS/CSS/TS/TSX frameworks to use, which build tools, linters, and testing environments to adopt, how to structure your project, organize your repository, and configure CI/CD pipelines. The list goes on. If this is something to built by a product + engineering team, despite more headcount, the complexity is often compounded even more.

Even after spending days setting everything up, you’ll never quite feel satisfied. You’ll tinker, tweak, and second-guess every decision for weeks. You’ll obsess over improving your stack, striving for a perfection that always seems just out of reach.

Meanwhile, only 5% of your time goes into building your actual product. The other 95%? That’s spent on the stack.

It’s exhausting. It’s frustrating.

But it wasn’t always like this.

In 2005, we just build

Back in 2005, things were simpler. We didn’t have CI/CD pipelines, JSX, HTML5, or a hundred other acronyms to wrangle. We worked with what we had, and we got things done. If that meant coding in PHP, so be it. If that meant using Notepad to write code, so be it. If that meant aligning layouts with <table> tags and <br>s, so be it. And yes, debugging live wasn’t unheard of—as long as no one accidentally dropped the production database.

We mainly had one major problem to solve back then: Internet Explorer 6. We overcame it, and we laughed about it later.

Building should be fun

In 2005, building was fun. It wasn’t bogged down by layers of complexity. We focused on the joy of creating, on bringing our ideas to life.

It’s 2025 now, and I’m ready to reclaim that joy. I’m resolving to build like it’s 2005. No over-engineering. No unnecessary complexity. Just building.

So here’s to simple tools, straightforward solutions, and rediscovering the fun in creation.

It’s 2025. Let’s build like it’s 2005!

Hello again

November 28, 2024

Hello again.

I started blogging in 2003 on b2, the precursor to WordPress. But over time, the updates slowed down, and the blog eventually faded away.

Now, I’m diving back in, again, embracing a more minimal and self-sufficient setup with Zola.

Why Zola?

Zola is a static site generator, which means it creates a bundle of self-contained, portable files that can be easily hosted anywhere without the need for dynamic server-side processing. It’s fast, lightweight, and just works.

Zola is built on Rust. I’ve been learning Rust and I am loving it. While there’s no real need to modify Zola itself, the fact that it’s Rust-powered resonates with me.

Zola is Markdown-based. Writing in Markdown is second nature to me. It’s clean, simple, and distraction-free.

Zola uses Tera, a simple templating language. It reminds me of PHP, the good side that is. It’s a refreshing change from the complexity of JSX.

I’m excited to return to the roots of blogging: self-publishing, owning my content, and keeping things simple.

Write more soon.