1010Computers | Computer Repair & IT Support

With a team that helped build the brain behind Alexa, HomeX raises $90M

HomeX, a home services platform for homeowners and service providers, has raised $90 million in a funding round led by New Mountain Capital.

New Mountain Capital, a New York-based investment firm with more than $30 billion in assets under management, was the only institutional investor to put money in this round alongside company executives. The company was bootstrapped until a 2019 $50 million-plus debt financing.

Founded in 2017, Chicago-based HomeX aims to “radically improve” home services by pairing service workers with homeowners, both virtually and in person. It also has built software, and offers services for, contractors that are aimed at helping them drive and manage demand “more efficiently.”

Notably, one of the company’s co-founders, CTO Simon Weaver, and several team members were on the development team of Evi, a startup that had built an AI program that can be communicated with using natural language via an app, that was acquired by Amazon in 2012. That technology was essentially the brain behind Amazon’s virtual assistant Alexa. 

HomeX uses artificial intelligence to diagnose home issues virtually before a contractor even goes out to a home, with the goal of helping them resolve a problem faster (by having the necessary equipment ahead of time for example), which in turn makes customers happier. 

“We’re using machine-generated content to create solutions that are specific to a homeowner’s issues,” said  co-founder and president Vincent Payen. “Using machines to understand symptoms, the questions to ask and to actually get to a diagnosis and a recommendation or resolution is where AI absolutely shines and allows us to do things that were not possible even three or five years ago.”

Founder and CEO Michael Werner worked in the $500 billion services industry for years (his family founded Werner Ladders) and recognized just how fragmented it was. He also acknowledges that, especially in certain markets, “there’s a terrible imbalance between very high demand and not enough contractors to do the work, or rather, a terrible labor shortage.”

HomeX Remote Assist in particular virtually connects homeowners (via phone, video or chat) with HomeX’s licensed technicians to diagnose and repair common home issues. That business unit has experienced more than 400% growth in less than a year, according to Werner. Last year, the company grew by “about 5x” the number of contractors on its platform. It declined to reveal revenue figures.

Image courtesy of HomeX

“For homeowners, we’re making home maintenance less complicated,” Werner said. “At the same time, we want to help the contractor succeed. Similar to how telemedicine has changed how medicine is delivered, HomeX Remote Assist is going to change the service experience for taking care of your home.”

Another area of HomeX’s business that is growing rapidly is its B2B offering. Home warranty and insurance companies see remote services “as very additive to make their business more efficient,” notes Payen.

“We are using some of our capital toward a pilot program and a number of business development opportunities there,” he said.

For now, while the company is not profitable overall, it is profitable in the services side of its business, according to Werner. In the last 12 months alone, it has served “hundreds of thousands” of clients via its platform, defined by unique virtual and physical appointments.  

New Mountain Capital Managing Director Harris Kealey said his firm viewed HomeX as a business that is primed to reshape the home and commercial services industry.

“The market is massive and the need for change and innovation is substantial,” he said in a written statement.

Another company in the space, Thumbtack, recently expanded into video home checkups. Thumbtack, a marketplace where you can hire local professionals for home improvement and other services such as repairs, in December acquired Setter, a startup which provided its customers with video home checkups conducted by experts, and then offered personalized plans for how to address any issues.

Thumbtack had laid off 250 employees at the end of March 2020, after the company saw big declines in its major markets. Since then, however, CEO Marco Zappacosta told TechCrunch there’s been “a renewed focus on the home and an acceleration of digital adoption.”

Powered by WPeMatico

SambaNova raises $676M at a $5.1B valuation to double down on cloud-based AI software for enterprises

Artificial intelligence technology holds a huge amount of promise for enterprises — as a tool to process and understand their data more efficiently; as a way to leapfrog into new kinds of services and products; and as a critical stepping stone into whatever the future might hold for their businesses. But the problem for many enterprises is that they are not tech businesses at their core, so bringing on and using AI will typically involve a lot of heavy lifting. Today, one of the startups building AI services is announcing a big round of funding to help bridge that gap.

SambaNova — a startup building AI hardware and integrated systems that run on it that only officially came out of three years in stealth last December — is announcing a huge round of funding today to take its business out into the world. The company has closed on $676 million in financing, a Series D that co-founder and CEO Rodrigo Liang has confirmed values the company at $5.1 billion.

The round is being led by SoftBank, which is making the investment via Vision Fund 2. Temasek and the government of Singapore Investment Corp. (GIC), both new investors, are also participating, along with previous backers BlackRock, Intel Capital, GV (formerly Google Ventures), Walden International and WRVI, among other unnamed investors. (Sidenote: BlackRock and Temasek separately kicked off an investment partnership yesterday, although it’s not clear if this falls into that remit.)

Co-founded by two Stanford professors, Kunle Olukotun and Chris Ré, and Liang, who had been an engineering executive at Oracle, SambaNova has been around since 2017 and has raised more than $1 billion to date — both to build out its AI-focused hardware, which it calls DataScale, and to build out the system that runs on it. (The “Samba” in the name is a reference to Liang’s Brazilian heritage, he said, but also the Latino music and dance that speaks of constant movement and shifting, not unlike the journey AI data regularly needs to take that makes it too complicated and too intensive to run on more traditional systems.)

SambaNova on one level competes for enterprise business against companies like Nvidia, Cerebras Systems and Graphcore — another startup in the space which earlier this year also raised a significant round. However, SambaNova has also taken a slightly different approach to the AI challenge.

In December, the startup launched Dataflow-as-a-Service as an on-demand, subscription-based way for enterprises to tap into SambaNova’s AI system, with the focus just on the applications that run on it, without needing to focus on maintaining those systems themselves. It’s the latter that SambaNova will be focusing on selling and delivering with this latest tranche of funding, Liang said.

SambaNova’s opportunity, Liang believes, lies in selling software-based AI systems to enterprises that are keen to adopt more AI into their business, but might lack the talent and other resources to do so if it requires running and maintaining large systems.

“The market right now has a lot of interest in AI. They are finding they have to transition to this way of competing, and it’s no longer acceptable not to be considering it,” said Liang in an interview.

The problem, he said, is that most AI companies “want to talk chips,” yet many would-be customers will lack the teams and appetite to essentially become technology companies to run those services. “Rather than you coming in and thinking about how to hire scientists and hire and then deploy an AI service, you can now subscribe, and bring in that technology overnight. We’re very proud that our technology is pushing the envelope on cases in the industry.”

To be clear, a company will still need data scientists, just not the same number, and specifically not the same number dedicating their time to maintaining systems, updating code and other more incremental work that comes managing an end-to-end process.

SambaNova has not disclosed many customers so far in the work that it has done — the two reference names it provided to me are both research labs, the Argonne National Laboratory and the Lawrence Livermore National Laboratory — but Liang noted some typical use cases.

One was in imaging, such as in the healthcare industry, where the company’s technology is being used to help train systems based on high-resolution imagery, along with other healthcare-related work. The coincidentally-named Corona supercomputer at the Livermore Lab (it was named after the 2014 lunar eclipse, not the dark cloud of a pandemic that we’re currently living through) is using SambaNova’s technology to help run calculations related to some COVID-19 therapeutic and antiviral compound research, Marshall Choy, the company’s VP of product, told me.

Another set of applications involves building systems around custom language models, for example in specific industries like finance, to process data quicker. And a third is in recommendation algorithms, something that appears in most digital services and frankly could always do to work a little better than it does today. I’m guessing that in the coming months it will release more information about where and who is using its technology.

Liang also would not comment on whether Google and Intel were specifically tapping SambaNova as a partner in their own AI services, but he didn’t rule out the prospect of partnering to go to market. Indeed, both have strong enterprise businesses that span well beyond technology companies, and so working with a third party that is helping to make even their own AI cores more accessible could be an interesting prospect, and SambaNova’s DataScale (and the Dataflow-as-a-Service system) both work using input from frameworks like PyTorch and TensorFlow, so there is a level of integration already there.

“We’re quite comfortable in collaborating with others in this space,” Liang said. “We think the market will be large and will start segmenting. The opportunity for us is in being able to take hold of some of the hardest problems in a much simpler way on their behalf. That is a very valuable proposition.”

The promise of creating a more accessible AI for businesses is one that has eluded quite a few companies to date, so the prospect of finally cracking that nut is one that appeals to investors.

“SambaNova has created a leading systems architecture that is flexible, efficient and scalable. This provides a holistic software and hardware solution for customers and alleviates the additional complexity driven by single technology component solutions,” said Deep Nishar, senior managing partner at SoftBank Investment Advisers, in a statement. “We are excited to partner with Rodrigo and the SambaNova team to support their mission of bringing advanced AI solutions to organizations globally.”

Powered by WPeMatico

Meroxa raises $15M Series A for its real-time data platform

Meroxa, a startup that makes it easier for businesses to build the data pipelines to power both their analytics and operational workflows, today announced that it has raised a $15 million Series A funding round led by Drive Capital. Existing investors Root, Amplify and Hustle Fund also participated in this round, which together with the company’s previously undisclosed $4.2 million seed round now brings total funding in the company to $19.2 million.

The promise of Meroxa is that businesses can use a single platform for their various data needs and won’t need a team of experts to build their infrastructure and then manage it. At its core, Meroxa provides a single software-as-a-service solution that connects relational databases to data warehouses and then helps businesses operationalize that data.

Image Credits: Meroxa

“The interesting thing is that we are focusing squarely on relational and NoSQL databases into data warehouse,” Meroxa co-founder and CEO DeVaris Brown told me. “Honestly, people come to us as a real-time FiveTran or real-time data warehouse sink. Because, you know, the industry has moved to this [extract, load, transform] format. But the beautiful part about us is, because we do change data capture, we get that granular data as it happens.” And businesses want this very granular data to be reflected inside of their data warehouses, Brown noted, but he also stressed that Meroxa can expose this stream of data as an API endpoint or point it to a Webhook.

The company is able to do this because its core architecture is somewhat different from other data pipeline and integration services that, at first glance, seem to offer a similar solution. Because of this, users can use the service to connect different tools to their data warehouse but also build real-time tools on top of these data streams.

Image Credits: Meroxa

“We aren’t a point-to-point solution,” Meroxa co-founder and CTO Ali Hamidi explained. “When you set up the connection, you aren’t taking data from Postgres and only putting it into Snowflake. What’s really happening is that it’s going into our intermediate stream. Once it’s in that stream, you can then start hanging off connectors and say, ‘Okay, well, I also want to peek into the stream, I want to transfer my data, I want to filter out some things, I want to put it into S3.’ ”

Because of this, users can use the service to connect different tools to their data warehouse but also build real-time tools to utilize the real-time data stream. With this flexibility, Hamidi noted, a lot of the company’s customers start with a pretty standard use case and then quickly expand into other areas as well.

Brown and Hamidi met during their time at Heroku, where Brown was a director of product management and Hamidi a lead software engineer. But while Heroku made it very easy for developers to publish their web apps, there wasn’t anything comparable in the highly fragmented database space. The team acknowledges that there are a lot of tools that aim to solve these data problems, but few of them focus on the user experience.

Image Credits: Meroxa

“When we talk to customers now, it’s still very much an unsolved problem,” Hamidi said. “It seems kind of insane to me that this is such a common thing and there is no ‘oh, of course you use this tool because it addresses all my problems.’ And so the angle that we’re taking is that we see user experience not as a nice-to-have, it’s really an enabler, it is something that enables a software engineer or someone who isn’t a data engineer with 10 years of experience in wrangling Kafka and Postgres and all these things. […] That’s a transformative kind of change.”

It’s worth noting that Meroxa uses a lot of open-source tools but the company has also committed to open-sourcing everything in its data plane as well. “This has multiple wins for us, but one of the biggest incentives is in terms of the customer, we’re really committed to having our agenda aligned. Because if we don’t do well, we don’t serve the customer. If we do a crappy job, they can just keep all of those components and run it themselves,” Hamidi explained.

Today, Meroxa, which the team founded in early 2020, has more than 24 employees (and is 100% remote). “I really think we’re building one of the most talented and most inclusive teams possible,” Brown told me. “Inclusion and diversity are very, very high on our radar. Our team is 50% black and brown. Over 40% are women. Our management team is 90% underrepresented. So not only are we building a great product, we’re building a great company, we’re building a great business.”  

Powered by WPeMatico

Zoho launches new low code workflow automation product

Workflow automation has been one of the key trends this year so far, and Zoho, a company known for its suite of affordable business tools has joined the parade with a new low code workflow product called Qntrl (pronounced control).

Zoho’s Rodrigo Vaca, who is in charge of Qntrl’s marketing says that most of the solutions we’ve been seeing are built for larger enterprise customers. Zoho is aiming for the mid-market with a product that requires less technical expertise than traditional business process management tools.

“We enable customers to design their workflows visually without the need for any particular kind of prior knowledge of business process management notation or any kind of that esoteric modeling or discipline,” Vaca told me.

While Vaca says, Qntrl could require some technical help to connect a workflow to more complex backend systems like CRM or ERP, it allows a less technical end user to drag and drop the components and then get help to finish the rest.

“We certainly expect that when you need to connect to NetSuite or SAP you’re going to need a developer. If nothing else, the IT guys are going to ask questions, and they will need to provide access,” Vaca said.

He believes this product is putting this kind of tooling in reach of companies that may have been left out of workflow automation for the most part, or which have been using spreadsheets or other tools to create crude workflows. With Qntrl, you drag and drop components, and then select each component and configure what happens before, during and after each step.

What’s more, Qntrl provides a central place for processing and understanding what’s happening within each workflow at any given time, and who is responsible for completing it.

We’ve seen bigger companies like Microsoft, SAP, ServiceNow and others offering this type of functionality over the last year as low code workflow automation has taken center stage in business.

This has become a more pronounced need during the pandemic when so many workers could not be in the office. It made moving work in a more automated workflow more imperative, and we have seen companies moving to add more of this kind of functionality as a result.

Brent Leary, principal analyst at CRM Essentials, says that Zoho is attempting to remove some the complexity from this kind of tool.

“It handles the security pieces to make sure the right people have access to the data and processes used in the workflows in the background, so regular users can drag and drop to build their flows and processes without having to worry about that stuff,” Leary told me.

Qntrl is available starting today starting at just $7 per user month.

Powered by WPeMatico

Vietnamese electric motorbike startup Dat Bike raises $2.6M led by Jungle Ventures

Son Nguyen, founder and chief executive officer of Dat Bike on one of the startup's motorbikes

Son Nguyen, founder and chief executive officer of Dat Bike

Dat Bike, a Vietnamese startup with ambitions to become the top electric motorbike company in Southeast Asia, has raised $2.6 million in pre-Series A funding led by Jungle Ventures. Made in Vietnam with mostly domestic parts, Dat Bike’s selling point is its ability to compete with gas motorbikes in terms of pricing and performance. Its new funding is the first time Jungle Ventures has invested in the mobility sector and included participation from Wavemaker Partners, Hustle Fund and iSeed Ventures.

Founder and chief executive officer Son Nguyen began learning how to build bikes from scrap parts while working as a software engineer in Silicon Valley. In 2018, he moved back to Vietnam and launched Dat Bike. More than 80% of households in Indonesia, Malaysia, Thailand and Vietnam own two-wheeled vehicles, but the majority are fueled by gas. Nguyen told TechCrunch that many people want to switch to electric motorbikes, but a major obstacle is performance.

Nguyen said that Dat Bike offers three times the performance (5 kW versus 1.5 kW) and 2 times the range (100 km versus 50 km) of most electric motorbikes in the market, at the same price point. The company’s flagship motorbike, called Weaver, was created to compete against gas motorbikes. It seats two people, which Nguyen noted is an important selling point in Southeast Asian countries, and has a 5000W motor that accelerates from 0 to 50 km per hour in three seconds. The Weaver can be fully charged at a standard electric outlet in about three hours, and reach up to 100 km on one charge (the motorbike’s next iteration will go up to 200 km on one charge).

Dat Bike’s opened its first physical store in Ho Chi Minh City last December. Nguyen said the company “has shipped a few hundred motorbikes so far and still have a backlog of orders.” He added that it saw a 35% month-over-month growth in new orders after the Ho Chi Minh City store opened.

At 39.9 million dong, or about $1,700 USD, Weaver’s pricing is also comparable to the median price of gas motorbikes. Dat Bike partners with banks and financial institutions to offer consumers twelve-month payment plans with no interest.

“These guys are competing with each other to put the emerging middle class of Vietnam on the digital financial market for the first time ever and as a result, we get a very favorable rate,” he said.

While Vietnam’s government hasn’t implemented subsidies for electric motorbikes yet, the Ministry of Transportation has proposed new regulations mandating electric infrastructure at parking lots and bike stations, which Nguyen said will increase the adoption of electric vehicles. Other Vietnamese companies making electric two-wheeled vehicles include VinFast and PEGA.

One of Dat Bike’s advantages is that its bikes are developed in house, with locally-sourced parts. Nguyen said the benefits of manufacturing in Vietnam, instead of sourcing from China and other countries, include streamlined logistics and a more efficient supply chain, since most of Dat Bike’s suppliers are also domestic.

“There are also huge tax advantages for being local, as import tax for bikes is 45% and for bike parts ranging from 15% to 30%,” said Nguyen. “Trade within Southeast Asia is tariff-free though, which means that we have a competitive advantage to expand to the region, compare to foreign imported bikes.”

Dat Bike plans to expand by building its supply chain in Southeast Asia over the next two to three years, with the help of investors like Jungle Ventures.

In a statement, Jungle Ventures founding partner Amit Anand said, “The $25 billion two-wheeler industry in Southeast Asia in particular is ripe for reaping benefits of new developments in electric vehicles and automation. We believe that Dat Bike will lead this charge and create a new benchmark not just in the region but potentially globally for what the next generation of two-wheeler electric vehicles will look and perform like.”

Powered by WPeMatico

Binance Labs leads $1.6M seed round in DeFi startup MOUND, the developer of Pancake Bunny

Decentralized finance startup MOUND, known for its yield farming aggregator Pancake Bunny, has raised $1.6 million in seed funding led by Binance Labs. Other participants included IDEO CoLab, SparkLabs Korea and Handshake co-founder Andrew Lee.

Built on Binance Smart Chain, a blockchain for developing high-performance DeFi apps, MOUND says Pancake Bunny now has more than 30,000 daily average users, and has accumulated more than $2.1 billion in total value locked (TVL) since its launch in December 2020.

The new funding will be used to expand Pancake Bunny and develop new products. MOUND recently launched Smart Vaults and plans to unveil Cross-Chain Collateralization in about a month, bringing the startup closer to its goal of covering a wide range of DeFi use cases, including farming, lending and swapping.

Smart Vaults are for farming single asset yields on leveraged lending products. It also automatically checks if the cost of leveraging may be more than anticipated returns and can actively lend assets for MOUND’s cross-chain farming.

Cross-Chain Collateralization is cross-chain yield farming that lets users keep original assets on their native blockchain instead of relying on a bridge token. The user’s original assets serve as collateral when the Bunny protocol borrows assets on the Binance Smart Chain for yield farming. This allows users to keep assets on native blockchains while giving them liquidity to generate returns on the Binance Smart Chain.

In a statement, Wei Zhou, Binance chief financial officer, and head of Binance Labs and M&A’s, said “Pancake Bunny’s growth and MOUND’s commitment to execution are impressive. Team MOUND’s expertise in live product design and service was a key factor in our decision to invest. We look forward to expanding the horizons of Defi together with MOUND.”

Powered by WPeMatico

Battery Resourcers raises $20M to commercialize its recycling-plus-manufacturing operations

Equipment at Battery Resourcers’ new cathode sintering and analysis facility in Novi, Michigan. (Photo: Battery Resourcers)

As a greater share of the transportation market becomes electrified, companies have started to grapple with how to dispose of the thousands of tons of used electric vehicle batteries that are expected to come off the roads by the end of the decade.

Battery Resourcers proposes a seemingly simple solution: recycle them. But the company doesn’t stop there. It’s engineered a “closed loop” process to turn that recycled material into nickel-manganese-cobalt cathodes to sell back to battery manufacturers. It is also developing a process to recover and purify graphite, a material used in anodes, to battery-grade.

Battery Resourcers’ business model has attracted another round of investor attention, this time with a $20 million Series B equity round led by Orbia Ventures, with injections from At One Ventures, TDK Ventures, TRUMPF Venture, Doral Energy-Tech Ventures and InMotion Ventures. Battery Resourcers CEO Mike O’Kronley declined to disclose the company’s new valuation.

The cathode and anode, along with the electrolyzer, are major components of battery architecture, and O’Kronley told TechCrunch it is this recycling-plus-manufacturing process that distinguishes the company from other recyclers.

“When we say that we’re on the verge of revolutionizing this industry, what we are doing is we are making the cathode active material — we’re not just recovering the metals that are in the battery, which a lot of other recyclers are doing,” he said. “We’re recovering those materials, and formulating brand new cathode active material, and also recovering and purifying the graphite active material. So those two active materials will be sold to a battery manufacturer and go right back into the new battery.”

“Other recycling companies, they’re focused on recovering just the metals that are in [batteries]: there’s copper, there’s aluminum, there’s nickel, there’s cobalt. They’re focused on recovering those metals and selling them back as commodities into whatever industry needs those metals,” he added. “And they may or may not go back into a battery.”

The company says its approach could reduce the battery industry’s reliance on mined metals — a reliance that’s only anticipated to grow in the coming decades. A study published last December found that demand for cobalt could increase by a factor of 17 and nickel by a factor of 28, depending on the size of EV uptake and advances in battery chemistries.

Thus far, the company’s been operating a demonstration-scale facility in Worcester, Massachusetts, and has expanded into a facility in Novi, Michigan, where it does analytical testing and material characterization. Between the two sites, the company can make around 15 tons of cathode materials a year. This latest funding round will help facilitate the development of a commercial-scale facility, which Battery Resourcers said in a statement will boost its capacity to process 10,000 tons of batteries per year, or batteries from around 20,000 EVs.

Another major piece of its proprietary recycling process is the ability to take in both old and new EV batteries, process them and formulate the newest kind of cathodes used in today’s batteries. “So they can take in 10-year-old batteries from a Chevy Volt and reformulate the metals to make the high-Ni cathode active materials in use today,” a company spokesman explained to TechCrunch.

Battery Resourcers is already receiving inquiries from automakers and consumer electronics companies, O’Kronley said, though he did not provide additional details. But InMotion Ventures, the venture capital arm of Jaguar Land Rover, said in a statement its participation in the round as a “significant investment.”

“[Battery Resourcers’] proprietary end-to-end recycling process supports Jaguar Land Rover’s journey to become a net zero carbon business by 2039,” InMotion managing director Sebastian Peck said.

Battery Resourcers was founded in 2015 after being spun out from Massachusetts’ Worcester Polytechnic Institute. The company has previously received support from the National Science Foundation and the U.S. Advanced Battery Consortium, a collaboration between General Motors, Ford Motor Company and Fiat Chrysler Automobiles.

Powered by WPeMatico

Atomico’s talent partners share 6 tips for early-stage people ops success

In the earliest stages of building a startup, it can be hard to justify focusing on anything other than creating a great product or service and meeting the needs of customers or users. However, there are still a number of surefire measures that any early-stage company can and should put in place to achieve “people ops” success as they begin scaling, according to venture capital firm Atomico‘s talent partners, Caro Chayot and Dan Hynes.

You need to recruit for what you need, but you also need to think about what is coming down the line.

As members of the VC’s operational support team, both work closely with companies in the Atomico portfolio to “find, develop and retain” the best employees in their respective fields, at various stages of the business. They’re operators at heart, and they bring a wealth of experience from time spent prior to entering VC.

Before joining Atomico, Chayot led the EMEA HR team at Twitter, where she helped scale the business from two to six markets and grew the team from 80 based in London to 500 across the region. Prior to that, she worked at Google in people ops for nine years.

Hynes was responsible for talent and staffing at well-known technology companies including Google, Cisco and Skype. At Google, he grew the EMEA team from 60 based in London to 8,500 across Europe by 2010, and at Skype, he led a talent team that scaled from 600 to 2,300 in three years.

Caro Chayot’s top 3 tips

1. Think about your long-term org design (18 months down the line) and hire back from there

When most founders think about hiring, they think about what they need now and the gaps that exist in their team at that moment. Dan and I help founders see things a little differently. You need to recruit for what you need, but you also need to think about what is coming down the line. What will your company look like in a year or 18 months? Functions and team sizes will depend on the sector — whether you are building a marketplace, a SaaS business or a consumer company. Founders also need to think about how the employees they hire now can develop over the next 18 months. If you hire people who are at the top of their game now, they won’t be able to grow into the employees you need in the future.

2. Spend time defining what your culture is. Use that for hiring and everything else people-related

If org design is the “what,” then culture is the “how.” It’s about laying down values and principles. It may sound fluffy, but capturing what it means to work at your company is key to hiring and retaining the best talent. You can use clearly articulated values at every stage of talent-building to shape your employer brand. What do you want potential employees to feel when they see your website? What do you want to look for in the interview process to make sure you are hiring people who are additive to the culture? How do you develop people and compensate them? These are all expressions of culture.

Powered by WPeMatico

Docugami’s new model for understanding documents cuts its teeth on NASA archives

You hear so much about data these days that you might forget that a huge amount of the world runs on documents: a veritable menagerie of heterogeneous files and formats holding enormous value yet incompatible with the new era of clean, structured databases. Docugami plans to change that with a system that intuitively understands any set of documents and intelligently indexes their contents — and NASA is already on board.

If Docugami’s product works as planned, anyone will be able to take piles of documents accumulated over the years and near-instantly convert them to the kind of data that’s actually useful to people.

If Docugami’s product works as planned, anyone will be able to take piles of documents accumulated over the years and near-instantly convert them to the kind of data that’s actually useful to people.

Because it turns out that running just about any business ends up producing a ton of documents. Contracts and briefs in legal work, leases and agreements in real estate, proposals and releases in marketing, medical charts, etc, etc. Not to mention the various formats: Word docs, PDFs, scans of paper printouts of PDFs exported from Word docs, and so on.

Over the last decade there’s been an effort to corral this problem, but movement has largely been on the organizational side: put all your documents in one place, share and edit them collaboratively. Understanding the document itself has pretty much been left to the people who handle them, and for good reason — understanding documents is hard!

Think of a rental contract. We humans understand when the renter is named as Jill Jackson, that later on, “the renter” also refers to that person. Furthermore, in any of a hundred other contracts, we understand that the renters in those documents are the same type of person or concept in the context of the document, but not the same actual person. These are surprisingly difficult concepts for machine learning and natural language understanding systems to grasp and apply. Yet if they could be mastered, an enormous amount of useful information could be extracted from the millions of documents squirreled away around the world.

What’s up, .docx?

Docugami founder Jean Paoli says they’ve cracked the problem wide open, and while it’s a major claim, he’s one of few people who could credibly make it. Paoli was a major figure at Microsoft for decades, and among other things helped create the XML format — you know all those files that end in x, like .docx and .xlsx? Paoli is at least partly to thank for them.

“Data and documents aren’t the same thing,” he told me. “There’s a thing you understand, called documents, and there’s something that computers understand, called data. Why are they not the same thing? So my first job [at Microsoft] was to create a format that can represent documents as data. I created XML with friends in the industry, and Bill accepted it.” (Yes, that Bill.)

The formats became ubiquitous, yet 20 years later the same problem persists, having grown in scale with the digitization of industry after industry. But for Paoli the solution is the same. At the core of XML was the idea that a document should be structured almost like a webpage: boxes within boxes, each clearly defined by metadata — a hierarchical model more easily understood by computers.

Illustration showing a document corresponding to pieces of another document.

Image Credits: Docugami

“A few years ago I drank the AI kool-aid, got the idea to transform documents into data. I needed an algorithm that navigates the hierarchical model, and they told me that the algorithm you want does not exist,” he explained. “The XML model, where every piece is inside another, and each has a different name to represent the data it contains — that has not been married to the AI model we have today. That’s just a fact. I hoped the AI people would go and jump on it, but it didn’t happen.” (“I was busy doing something else,” he added, to excuse himself.)

The lack of compatibility with this new model of computing shouldn’t come as a surprise — every emerging technology carries with it certain assumptions and limitations, and AI has focused on a few other, equally crucial areas like speech understanding and computer vision. The approach taken there doesn’t match the needs of systematically understanding a document.

“Many people think that documents are like cats. You train the AI to look for their eyes, for their tails … documents are not like cats,” he said.

It sounds obvious, but it’s a real limitation. Advanced AI methods like segmentation, scene understanding, multimodal context, and such are all a sort of hyperadvanced cat detection that has moved beyond cats to detect dogs, car types, facial expressions, locations, etc. Documents are too different from one another, or in other ways too similar, for these approaches to do much more than roughly categorize them.

As for language understanding, it’s good in some ways but not in the ways Paoli needed. “They’re working sort of at the English language level,” he said. “They look at the text but they disconnect it from the document where they found it. I love NLP people, half my team is NLP people — but NLP people don’t think about business processes. You need to mix them with XML people, people who understand computer vision, then you start looking at the document at a different level.”

Docugami in action

Illustration showing a person interacting with a digital document.

Image Credits: Docugami

Paoli’s goal couldn’t be reached by adapting existing tools (beyond mature primitives like optical character recognition), so he assembled his own private AI lab, where a multidisciplinary team has been tinkering away for about two years.

“We did core science, self-funded, in stealth mode, and we sent a bunch of patents to the patent office,” he said. “Then we went to see the VCs, and SignalFire basically volunteered to lead the seed round at $10 million.”

Coverage of the round didn’t really get into the actual experience of using Docugami, but Paoli walked me through the platform with some live documents. I wasn’t given access myself and the company wouldn’t provide screenshots or video, saying it is still working on the integrations and UI, so you’ll have to use your imagination … but if you picture pretty much any enterprise SaaS service, you’re 90% of the way there.

As the user, you upload any number of documents to Docugami, from a couple dozen to hundreds or thousands. These enter a machine understanding workflow that parses the documents, whether they’re scanned PDFs, Word files, or something else, into an XML-esque hierarchical organization unique to the contents.

“Say you’ve got 500 documents, we try to categorize it in document sets, these 30 look the same, those 20 look the same, those five together. We group them with a mix of hints coming from how the document looked, what it’s talking about, what we think people are using it for, etc.,” said Paoli. Other services might be able to tell the difference between a lease and an NDA, but documents are too diverse to slot into pre-trained ideas of categories and expect it to work out. Every set of documents is potentially unique, and so Docugami trains itself anew every time, even for a set of one. “Once we group them, we understand the overall structure and hierarchy of that particular set of documents, because that’s how documents become useful: together.”

Illustration showing a document being turned into a report and a spreadsheet.

Image Credits: Docugami

That doesn’t just mean it picks up on header text and creates an index, or lets you search for words. The data that is in the document, for example who is paying whom, how much and when, and under what conditions, all that becomes structured and editable within the context of similar documents. (It asks for a little input to double check what it has deduced.)

It can be a little hard to picture, but now just imagine that you want to put together a report on your company’s active loans. All you need to do is highlight the information that’s important to you in an example document — literally, you just click “Jane Roe” and “$20,000” and “five years” anywhere they occur — and then select the other documents you want to pull corresponding information from. A few seconds later you have an ordered spreadsheet with names, amounts, dates, anything you wanted out of that set of documents.

All this data is meant to be portable too, of course — there are integrations planned with various other common pipes and services in business, allowing for automatic reports, alerts if certain conditions are reached, automated creation of templates and standard documents (no more keeping an old one around with underscores where the principals go).

Remember, this is all half an hour after you uploaded them in the first place, no labeling or pre-processing or cleaning required. And the AI isn’t working from some preconceived notion or format of what a lease document looks like. It’s learned all it needs to know from the actual docs you uploaded — how they’re structured, where things like names and dates figure relative to one another, and so on. And it works across verticals and uses an interface anyone can figure out in a few minutes. Whether you’re in healthcare data entry or construction contract management, the tool should make sense.

The web interface where you ingest and create new documents is one of the main tools, while the other lives inside Word. There Docugami acts as a sort of assistant that’s fully aware of every other document of whatever type you’re in, so you can create new ones, fill in standard information, comply with regulations and so on.

Okay, so processing legal documents isn’t exactly the most exciting application of machine learning in the world. But I wouldn’t be writing this (at all, let alone at this length) if I didn’t think this was a big deal. This sort of deep understanding of document types can be found here and there among established industries with standard document types (such as police or medical reports), but have fun waiting until someone trains a bespoke model for your kayak rental service. But small businesses have just as much value locked up in documents as large enterprises — and they can’t afford to hire a team of data scientists. And even the big organizations can’t do it all manually.

NASA’s treasure trove

Image Credits: NASA

The problem is extremely difficult, yet to humans seems almost trivial. You or I could glance through 20 similar documents and a list of names and amounts easily, perhaps even in less time than it takes for Docugami to crawl them and train itself.

But AI, after all, is meant to imitate and transcend human capacity, and it’s one thing for an account manager to do monthly reports on 20 contracts — quite another to do a daily report on a thousand. Yet Docugami accomplishes the latter and former equally easily — which is where it fits into both the enterprise system, where scaling this kind of operation is crucial, and to NASA, which is buried under a backlog of documentation from which it hopes to glean clean data and insights.

If there’s one thing NASA’s got a lot of, it’s documents. Its reasonably well-maintained archives go back to its founding, and many important ones are available by various means — I’ve spent many a pleasant hour perusing its cache of historical documents.

But NASA isn’t looking for new insights into Apollo 11. Through its many past and present programs, solicitations, grant programs, budgets, and of course engineering projects, it generates a huge amount of documents — being, after all, very much a part of the federal bureaucracy. And as with any large organization with its paperwork spread over decades, NASA’s document stash represents untapped potential.

Expert opinions, research precursors, engineering solutions, and a dozen more categories of important information are sitting in files searchable perhaps by basic word matching but otherwise unstructured. Wouldn’t it be nice for someone at JPL to get it in their head to look at the evolution of nozzle design, and within a few minutes have a complete and current list of documents on that topic, organized by type, date, author and status? What about the patent advisor who needs to provide a NIAC grant recipient information on prior art — shouldn’t they be able to pull those old patents and applications up with more specificity than any with a given keyword?

The NASA SBIR grant, awarded last summer, isn’t for any specific work, like collecting all the documents of such and such a type from Johnson Space Center or something. It’s an exploratory or investigative agreement, as many of these grants are, and Docugami is working with NASA scientists on the best ways to apply the technology to their archives. (One of the best applications may be to the SBIR and other small business funding programs themselves.)

Another SBIR grant with the NSF differs in that, while at NASA the team is looking into better organizing tons of disparate types of documents with some overlapping information, at NSF they’re aiming to better identify “small data.” “We are looking at the tiny things, the tiny details,” said Paoli. “For instance, if you have a name, is it the lender or the borrower? The doctor or the patient name? When you read a patient record, penicillin is mentioned, is it prescribed or prohibited? If there’s a section called allergies and another called prescriptions, we can make that connection.”

“Maybe it’s because I’m French”

When I pointed out the rather small budgets involved with SBIR grants and how his company couldn’t possibly survive on these, he laughed.

“Oh, we’re not running on grants! This isn’t our business. For me, this is a way to work with scientists, with the best labs in the world,” he said, while noting many more grant projects were in the offing. “Science for me is a fuel. The business model is very simple — a service that you subscribe to, like Docusign or Dropbox.”

The company is only just now beginning its real business operations, having made a few connections with integration partners and testers. But over the next year it will expand its private beta and eventually open it up — though there’s no timeline on that just yet.

“We’re very young. A year ago we were like five, six people, now we went and got this $10 million seed round and boom,” said Paoli. But he’s certain that this is a business that will be not just lucrative but will represent an important change in how companies work.

“People love documents. Maybe it’s because I’m French,” he said, “but I think text and books and writing are critical — that’s just how humans work. We really think people can help machines think better, and machines can help people think better.”

Powered by WPeMatico

Microsoft goes all in on healthcare with $19.7B Nuance acquisition

When Microsoft announced it was acquiring Nuance Communications this morning for $19.7 billion, you could be excused for doing a Monday morning double take at the hefty price tag.

That’s surely a lot of money for a company on a $1.4 billion run rate, but Microsoft, which has already partnered with the speech-to-text market leader on several products over the last couple of years, saw a company firmly embedded in healthcare and decided to go all in.

And $20 billion is certainly all in, even for a company the size of Microsoft. But 2020 forced us to change the way we do business, from restaurants to retailers to doctors. In fact, the pandemic in particular changed the way we interact with our medical providers. We learned very quickly that you don’t have to drive to an office, wait in waiting room, then in an exam room, all to see the doctor for a few minutes.

Instead, we can get on the line, have a quick chat and be on our way. It won’t work for every condition, of course — there will always be times the physician needs to see you — but for many meetings such as reviewing test results or for talk therapy, telehealth could suffice.

Microsoft CEO Satya Nadella says that Nuance is at the center of this shift, especially with its use of cloud and artificial intelligence, and that’s why the company was willing to pay the amount it did to get it.

“AI is technology’s most important priority, and healthcare is its most urgent application. Together, with our partner ecosystem, we will put advanced AI solutions into the hands of professionals everywhere to drive better decision-making and create more meaningful connections, as we accelerate growth of Microsoft Cloud in Healthcare and Nuance,” Nadella said in a post announcing the deal.

Holger Mueller, an analyst at Constellation Research, says that may be so, but he believes that Microsoft missed the boat with Cortana and this is about helping the company catch up on a crucial technology. “Nuance will be not only give Microsoft technology help in regards to neural network-based speech recognition, but also a massive improvement from vertical capabilities, call center functionality and the MSFT IP position in speech,” he said.

Microsoft sees this deal doubling what was already a considerable total addressable market to nearly $500 billion. While TAMs always tend to run high, that is still a substantial number.

It also fits with Gartner data, which found that by 2022, 75% of healthcare organizations will have a formal cloud strategy in place. The AI component only adds to that number and Nuance brings 10,000 existing customers to Microsoft, including some of the biggest healthcare organizations in the world.

Brent Leary, founder and principal analyst at CRM Essentials, says the deal could provide Microsoft with a ton of health data to help feed the underlying machine learning models and make them more accurate over time.

“There is going be a ton of health data being captured by the interactions coming through telemedicine interactions, and this could create a whole new level of health intelligence,” Leary told me.

That of course could drive a lot of privacy concerns where health data is involved, and it will be up to Microsoft, which just experienced a major breach on its Exchange email server products last month, to assure the public that their sensitive health data is being protected.

Leary says that ensuring data privacy is going to be absolutely key to the success of the deal. “The potential this move has is pretty powerful, but it will only be realized if the data and insights that could come from it are protected and secure — not only protected from hackers but also from unethical use. Either could derail what could be a game-changing move,” he said.

Microsoft also seemed to recognize that when it wrote, “Nuance and Microsoft will deepen their existing commitments to the extended partner ecosystem, as well as the highest standards of data privacy, security and compliance.”

Kate Leggett, an analyst at Forrester Research, thinks healthcare could be just the first step and once Nuance is in the fold, it could go much deeper than that.

“However, the benefit of this acquisition does not stop [with healthcare]. Nuance also offers market-leading customer engagement technologies, with deep expertise and focus in verticals such as financial services. As MSFT evolves their industry editions into other verticals, this acquisition will pay off for other industries. MSFT may also choose to fill in the gaps within their Dynamics solution with Nuance’s customer engagement technologies,” Leggett said.

We are clearly on the edge of a sea change when it comes to how we interact with our medical providers in the future. COVID pushed medicine deeper into the digital realm in 2020 out of simple necessity. It wasn’t safe to go into the office unless absolutely necessary.

The Nuance acquisition, which is expected to close some time later this year, could help Microsoft shift deeper into the market. It could even bring Teams into it as a meeting tool, but it’s all going to depend on the trust level people have with this approach, and it will be up to the company to make sure that both healthcare providers and the people they serve have that.

Powered by WPeMatico