Rise of the machines (decision making when milliseconds matter)


One of the benefits of being part of a seed-stage fund is that we often get to see the next big trends before they become widely known.  For example, Josh has a widely known thesis on data-exhaust and the implicit web where he identifies the value of the data trail we all leave behind. This thesis supported multiple investments including our investment in Mint.

In the 200 milliseconds it takes you to draw your gun, my algorithmic gun slinger could make 39 stock trades and shoot you in the heart

I believe we are now seeing a change in data processing and decision-making that will be equally significant for investors and entrepreneurs.

Historically, data analysis and computation was done on log files and stored data pools.  In these types of businesses, the data decisioning was done “out of band” — or not in real-time.  However, we’ve now started to see a whole series of applications and businesses where data analysis and decisioning is happening “in band” on streams of data.  In these applications, milliseconds matter.  Massive data analysis and computation are being performed in real-time — and the user’s experience is affected by this analysis.

This is a fundamental change.  Humans do not operate in milliseconds. For the real time web to function, the human decisions have to occur before the clock starts. We need to focus on predictive analysis and algorithms that make rule based decisions for us informed by the data stream.

The Facebook Newsfeed (already has machine intervention) and the Twitter Stream are the most frequently noted “data streams”, but the auto-generated data created by every consumer action, ad impression and click are orders of magnitude larger.  Including the data streams generated by CDNs and ISP’s you can see the exponential nature of the decision requirements in this new streaming data world.

The first vertical to move to real-time is advertising (Spark Cpaital’s Mo Koyfman has a nice summary of the shift in on-line advertising on his blog). It is not surprising that advertising is the first industry to move in this direction. It is most similar to the financial markets and over the past ten years the percentage of equity trades on US exchanges driven by algorithms has grown to over 70%. This was the insight behind Appnexus and Invite Media and may explain why some of the first guys to envision a transparent market for display inventory and real-time bidding came out of a finance school.

In our portfolio I see the power of operating in stream. Ad insertion order compliance can now happen in real-time based on contextual data streams analyzed by Double Verify.  VigLink can identify un-affiliated links across the web and not only append the link with an affiliate code, but choose the profit maximizing link in each instance, in real-time. Knewton’s testing platform is able to provide each student with a customized and personalized test-prep experience, based on their real-time adaptive education platform.  Aggregate Knowledge automatically produces personalized and dynamic creative for advertisements by using real-time algorithms.  In milliseconds Monetate applies a specific set of merchandising rules to individual consumer data streams. The result is a unique shopping experience for each visitor to an e-commerce site.

As we move from a world of data pools to data streams and processing power is distributed to the edge, what other changes will take place now that milliseconds matter? Will infrastructure changes take place as well? Will the real-time web increase the value of a millisecond enough to force companies to co-locate their algorithms at the CDN site or even the end-point device level?

I would love to discuss it in the comments, @phineasb or phin@firstround.com

, ,

  • Pingback: uberVU - social comments

  • http://www.funnelscope.com/ kulsingh

    Phin, fantastic post! I agree with your thoughts above and think we will soon move from wisdom of the crowd to wisdom before the crowd (as we have seen in other verticals like financial services via algo trading). The potential applications that can built on this framework will be compelling and exiting. I am working on a new venture (Tickreel) that is working on this infrastructure and is days from a prototype. Our platform hits on many of the themes mentioned above. Great to hear FRC provide insights on their vision, especially since it aligns well with ours. I'll reach out to Josh and Howard when we are ready.

  • http://www.separatepiece.com phineasb

    Sounds good. Look forward to learning more about what you are up to when the time is right

  • eugmandel

    Excellent post! “Humans do not operate in milliseconds”. Also, humans do not operate on the vast amount of data created by the real time web. This does call for machines to be in the stream and make decisions by themselves.
    Another class of applications needed is machines processing the stream and making it palatable for human consumption.

  • http://www.separatepiece.com phineasb

    Yes. Filters will be needed. I wonder if the best filters will be machines (search and recommendation engines) or humans (social recommendations)
    Phineas Barnes
    Principal, First Round Capital
    215.327.0849
    phin@firstround.com
    @phineasb

  • Pingback: Most Tweeted Articles by Venture Capital Experts: MrTweet

  • eugmandel

    Alex and I are trying a hybrid approach. Remember we talked about our Twitter ad network technology? We stopped doing it (long story :) ), but the tech we developed came very handy for our new project.

  • http://www.separatepiece.com phineasb

    I remember the original idea and look forward to seeing where the building blocks of that project lead you in the new one
    Phineas Barnes
    Principal, First Round Capital
    215.327.0849
    phin@firstround.com
    @phineasb

  • http://www.mustexist.com/ Alex Sherstinsky

    I agree that the trend that Phineas is describing has the potential to change the data processing stack through its many layers, including the underlying infrastructure. In order for this change to happen, there needs to be a sufficiently broad class of applications that drive and focus these developments around markets. Everyone knows that, which is why in most cases we try to err on the side of developing a minimally viable product and get customer validation. That being said, a lot of us — at least Eugene and I — are hoping to get an opportunity — expressed by customer demand — to develop a powerful technology like that.

  • http://www.twitter.com/famolari famolari

    Great post and important topic. Real time processing and decision-making will certainly find application beyond fintech and adtech, though those areas have clear and immediate financial implications and thus will drive lots of early innovation.

    Milliseconds definitely matter in high frequency trading, which is why you see many firms trying to compete on physics by co-locating. But that’s not inherently alpha-generating and advantages will be arb’d or regulated away (NASDAQ said they may start mandating the length of networking cables, for instance.)

    Real-time could do some very cool stuff in gaming as well. Imagine the kick-ass, augmented reality scenarios people could come up with if they could interpret massive amounts of visual, online and bio data all in real-time.

    Beyond all the fascinating mining of social graphs and data exhaust that FRC’s been talking about, there are many other verticals, many not consumer facing, that are being enhanced by closing the decision-making and real-time information processing loop. From health care (vitals monitoring, sensor fusion) to cyber security and terrorism prevention (anomaly & intrusion detection, pattern analysis) to energy (smart grid demand-response, dynamic pricing, load balancing), the applications are endless. In all these areas, collecting, monitoring and interpreting massive amounts of real-time data in-line adds tremendous value.

    It’s a good time to be in algorithms, information processing and high-performance computing. I expect to see lots of new companies and investment focus aimed at leveraging the above to serve specific business needs. Roger Ehrenberg’s new “big data” fund is an early and excellent example.

  • http://www.separatepiece.com phineasb

    Great points on the other areas that will be impacted. Thanks for adding to this post with the insightful comment. Also agree on your point about Rodger's focus and think he will not be the last to zero in on this area.

  • http://twitter.com/manuelmartin Manuel Martin

    Hi Phin,

    Thank you for this thought provoking post. It seems like the success of the real time web largely depends on the success of the intelligent web. As data volumes increase, humans and applications will need help filtering out the noise to focus on what matters most for a particular task or decision. Some could argue that only operational decisions (e.g. credit approvals) can benefit from real time predictive analytics, I believe strategic decisions (i.e. should we invest in this company) will still need human intervention but the outcome can be improved by having the right information sooner. Real time (or near real time) predictive analytics could seem like a no brainer but not all industries will benefit the same. In addition to the number of decisions, a key adoption driver will be the cost of making a wrong decision b/c of not having the right insight at the right time.

    Cheers,
    -Manuel
    http://www.in2clouds.com

  • http://www.sneakerheadVC.com/ phineasb

    I agree with you and especially like your thoughts on how adoption will be driven by the cost of making a bad decision. It would be interesting to map out an adoption chart based on an assessment of the costs of being wrong.

    Could this be the thing that drives medical professionals to finally give up pen and paper data entry?
    Phineas Barnes
    Principal, First Round Capital
    phin@firstround.com
    http://www.sneakerheadVC.com
    @phineasb

  • Steve

    Phin,

    Thanks for your well written, thought provoking post. You are certainly correct in your assessment that the existing models between advertiser and publisher are not working. I for one believe that it’s not just an issue of mis alignment, but is even more fundamental and structurally flawed. The current models are predicated on a mass market, mass media system, where audiences where acquired via monopolistic (economic, legal, technical) providers and there was an absolute scarcity of inventory which which could be sold at maximum value, at prices validated by GIA /AIA equivalent appraisers like Nielsen and Arbitron.

    But now, it’s changing very quickly to a conversational model for which a good measurement really hasn’t evolved just yet. To a degree CPC is the best thing going for now, but I think to your point, new measures of effectiveness and subsequent value need to be created.

    I would go a step further and say it’s not even a system that is broke, since it never was working in the first place. Sure looks like a wide open area for new thinking and innovation.

  • http://twitter.com/gulpthis Gulp Media

    This argument seems to me to be so over thought.

    Artists create content. Artists gain audiences. Marketers need reach. So, marketers buy audience, thereby sponsoring the artists.

    If the artist knows enough about every single person in their audience, they can sell “targeted reach” maximizing the full value of their audience.

    The first question I get is “How can you collect that data?”

    Anyone ever think to just ask? I did.

  • http://waxingphilosophic.com josh guttman

    Congrat’s on the move Phin. Psyched to have you guys in NYC.

    I predict your inbox will suffer.

  • http://spencerfry.com/ Spencer Fry

    F yeah!

  • http://www.christinacacioppo.com Christina Cacioppo

    Congrats! Makes me that much more excited for the next dinner.

  • http://www.adventurista.com/ Sarah Tavel

    Congrats, Phin! Excited you’ll finally be making the move.

  • http://innonate.com/ innonate

    We’re glad to have you!

  • http://twitter.com/MikeDuda MikeDuda

    Welcome to the jungle. And psyched you’re here.

  • http://www.brianrothenberg.com/ Brian Rothenberg

    Congrats on pulling the trigger and making the move. Looking forward to seeing you around more than our usual once a month!

  • http://justin-singer.com/ Justin Singer

    Looking forward to having you around man.

  • MK

    see you Monday… in Philly! ha! Enjoy the Amtrak down :)