Successful AI Companies Build Insurmountable Leads Using Data Strategy

Blue River Technology's “see and spray” tech at work on a crop sprayer.

The most recent issue of MIT Technology Review shows their annual list of 35 Innovators Under 35 Of these, 15 are AI-based – 43%.  Another 3 are in Computational Synthetic Biology that depends on deep learning.

Similarly the website which tracks the formation and investment in startups shows about 6,800 companies specifically relating to AI.  That’s probably understated.  I’d round up to an even 10,000.

So it’s no surprise that AI is the siren song that launched 10,000 ships.  The real question is how many will survive for even the next three years?

We’re not talking about how existing companies should capitalize on AI to enhance their business.  We’re talking about how to become the next Google, Facebook, or Amazon with a lead so dominant that no one can catch up.

The Single Key Strategy that Defines AI Success: Data Dominance

Start to look at individual companies and you’ll see that they are focused on their technology, the user experience, and their product or platform.  This perspective will take them no further than being just another product or perhaps only a feature.  It will not take them to becoming a long term viable company that will return their investor’s capital, much less the desired multiple.

To create a successful AI company you must create such a wide moat that no one can catch up unless they pay your price.  That moat is not about technology.  There are essentially no monopolies on deep learning technologies, only leaders that can quickly be copied.

The secret to a wide moat in AI is to have a virtual monopoly on the data you are using to train.  In this case monopoly also means such a large lead in users and data volume that no one can reasonably catch up.

How to Create a Data Monopoly

All AI companies face the same barrier when starting out:  how to obtain enough data to train their product.

Everyone recognizes this virtuous feedback cycle, but without users you can’t generate sufficient data, and so it continues.

The question they should be asking, even before taking investment is how the data can be acquired in a way that is strategically defensible.  The answer to this question will simply eliminate many markets and applications where data is not defensible or competitors already have substantial leads.

For example, there’s no wide moat available in advertising.  Google dominates search-based advertising and Facebook dominates social media based advertising.  General e-commerce?  Can’t beat the lead that Amazon has in learning about our personal shopping desires.  These three industry giants clearly have defensible positions by virtue of their dominant data.

So How Then to Identify and Collect Defensible Data

A defensible data strategy is not something you can sprinkle on any AI startup.  It starts by carefully selecting the industry and the problem to be solved.  These are not easy to find, but here are some examples to get your thought processes started.

You’ll find here a unique blend of identifying markets and market needs where the addition of AI creates opportunity.  You’ll also see examples of creating new types of data in existing markets that competitors can’t duplicate.

Here are a few selected examples that exemplify good data strategies:

Blue River Technology:  This is a company that offers agricultural optimization by evaluating each plant individually at each stage of growth.  There are plenty of competitors that use drones or stationary sensors to divide a field into smaller segments to be optimized but no competitor that does this on a plant-by-plant basis.

Their technology platform looks like 30 foot wide arms on the front of a tractor that literally takes an image of each plant (think lettuce for example) as the arm passes over.  Based on their AI model the platform makes an AI-driven instantaneous decision to provide water, fertilizer, or to apply an herbicide.  No sense putting energy into a plant that’s not going to make it or if it’s a weed.  Blue River calls this ‘see and spray’.

The process of getting the training data wasn’t simple and involved a significant investment in running their prototype platform over farm fields to acquire images of individual plants which were then coded for health, sickness, and optimum use of fertilizer and water.  They now have the world’s largest database of plant images which continues to grow with each pass of their equipment over a field.  Their lead in plant level AI image training data in unassailable.

[Editor’s Note: Deere & Company acquired Blue River Technology in September 2017 for $305 million.]

Read the source article in Data Science Central.