paint-brush
Making AI Work for Your Startup: Essential Dos and Don'tsby@yuridvoinos
299 reads

Making AI Work for Your Startup: Essential Dos and Don'ts

by Yuri DvoinosApril 5th, 2024
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

This article delves into the essential considerations for startups venturing into AI development, emphasizing the importance of assessing AI's impact on use cases, building a skilled team, and acquiring high-quality data sets. It highlights the challenges and strategic insights necessary for successful AI integration in startup environments.
featured image - Making AI Work for Your Startup: Essential Dos and Don'ts
Yuri Dvoinos HackerNoon profile picture

In the dynamic world of startups, AI technologies have reached a pivotal juncture, unlocking the potential for true breakthroughs across various industries. With the AI landscape ripe for disruption, startups are eagerly exploring innovative use cases where AI can unleash a tenfold increase in productivity. As a seasoned entrepreneur deeply entrenched in AI, I've mentored numerous startups through this maze, witnessing common missteps early in the game. A frequent illusion harbored by founders is the 'quick-fix' fallacy, believing a plug-and-play approach with AI can lead to swift profits.


Before embarking on the journey of AI development, founders should reflect on these three crucial questions:


Is AI Truly a Game-Changer for Your Use Case?

Reflecting on my journey, I recall when my team chose an unconventional path while developing a computer game called “Reel Valley”, opting for nim-lang, an offbeat programming language. This choice, although innovative, diverted us away from essential aspects like core gameplay development and market fit. Our ambitious tech exploration led to delayed and flawed releases. A lesson learned: it’s often more pragmatic to use standard tools like Unity, concentrating on what truly matters – the product itself.


This scenario echoes a common conundrum faced by many startup founders. The allure of integrating AI, largely due to its tech glamor can be tempting. However, the key is to stay devoutly aligned with your customer needs rather than getting swept up in technological hype.


For instance, imagine a startup that aims to simplify online shopping experiences. While an AI-powered recommendation engine might seem like a futuristic addition, a basic heuristic algorithm based on customer purchase history and preferences could initially suffice and effectively meet customer needs. It’s imperative for founders to critically assess: Does AI offer a real tenfold improvement for our specific use case, or are we chasing it for the sake of being on the tech forefront?


Do You Have a Team with Relevant Experience?

Delving into the AI landscape reveals a mosaic of specialized technologies. Often bundled under the 'AI' tag are distinct domains like Large Language Models (LLM), Machine Learning (ML), Natural Language Processing (NLP) and others. Each field is vast and complex enough to require dedicated expertise. Thus, when building an AI solution, it's crucial to have a team member adept in the specific technology relevant to your project.


Even for a team of experienced Data Scientists, crafting effective AI models can pose significant challenges. At Aura, for example, our goal was to develop a Machine Learning model for SMS protection, filtering out spam and scams from messages sent by unknown numbers. The task, straightforward with clear-cut spam, became intricate when it came to detecting scams, often crafted to mimic messages from connections. Imagine the difficulty in discerning genuine messages from deceptive ones like, 'Hi, I miss you. How are you?' This complexity was a formidable challenge, even for Data Scientists boasting a decade of experience.


Beyond the creation of the model, the proficiency of Data Scientists in communicating and guiding the broader team on evaluating the model's performance is invaluable. We utilized the “Confusion Matrix” methodology to measure accuracy. This approach involves an initial analysis of a few hundred messages by the model, followed by our experts scoring each message as 'scam,' 'not scam,' or 'unsure.' Comparing these human-assessed scores against the model's outputs helped us identify false negatives (actual scam messages missed by the model) and false positives (legitimate messages incorrectly flagged as spam). The process of training the model—reprocessing messages, tweaking probabilities, and modifying algorithms—spanned over six months from test to production. But this wasn’t a one-off task, as scam techniques evolve, so does our model, demanding ongoing refinement and recalibration.


Do You Have a Large High-Quality Data Set?

Embarking on AI development in startups, such as my experience at FigLeaf, centers around a critical challenge: ensuring the quality of data. FigLeaf aimed to empower users to maintain their privacy online, particularly in registering on websites anonymously with masked emails. We sought to create a Machine Learning model to differentiate registration forms from login forms, a task complicated by the lack of standardization in form designs across the web.


Initially, we thought the solution was straightforward. However, the first versions of our model suffered from performance issues, primarily due to inadequate training data. We had our Customer Support team manually label hundreds of web forms every day, believing that several thousand forms a month would be sufficient. But reality quickly set in—this amount was just a drop in the ocean, barely scratching the surface of what was needed for the model to function effectively across the diverse landscape of the internet.


Adding to the complexity, websites often employ A/B testing for their signup pages or maintain different versions for distinct audience segments. This variability meant we were not just dealing with a vast number of forms, but also with forms that were constantly changing or appearing in different versions. It highlighted that in Machine Learning, 'a substantial amount' of data means far more than one might initially anticipate.

Ultimately, the road to AI success in startups is paved with meticulous planning and patience. Crafting innovative AI technology is a marathon, not a sprint, where shortcuts can be deceptive detours. A grounded, realistic approach to development timelines is crucial. It's through this committed and thoughtful process that your AI initiative will evolve into a groundbreaking, market-leading force.