The Ultimate Guide To Data Analysis


Data analysis comes at the tail end of the data lifecycle. Directly after or simultaneously performed with data integration (in which data from different sources are pulled into a unified view). Data analysis involves cleaning, modelling, inspecting and visualizing data.

The ultimate goal of data analysis is to provide useful data-driven insights for guiding organizational decisions. And without data analysis, you might as well not even collect data in the first place. Data analysis is the process of turning data into information, insight, or hopefully knowledge of a given domain.
(more…)

Read More

Converting text documents into knowledge graphs with the Diffbot Natural Language API

Most of the world’s knowledge is encoded in natural language (e.g., news articles, books, emails, academic papers). It is estimated that 80 percent of business-relevant information originates in unstructured form, primarily text. However, the ambiguous nature of human communication makes it difficult for software engineers and data scientists to leverage this information in their applications.

After years of research, we are proud to announce the Diffbot Natural Language API, a new product to help businesses convert their text documents into knowledge graphs. Knowledge graphs represent information about real-world entities (e.g., people, organizations, products, articles) via their relationships with other entities (e.g., founded by, educated at, was mentioned in). This is the same production-grade technology that we use to build the world’s largest knowledge graph from the web, and we are making it available to all.

(more…)

Read More

Is RPA Tech Becoming Outdated? Process Bots vs Search Bots in 2020

The original robots who caught my attention had physical human characteristics, or at least a physically visible presence in three dimensions: C3PO and R2D2 form the perfect duo, one modeled to walk and talk like a bookish human, the other with metallic, baby-like cuteness and it’s own language. 

Both were imagined, but still very tangible. And this imagery held staying power. This is how most of us still think about robots today. Follow the definition of robot and the following phrase surface, “a machine which resembles a human.” A phrase only followed by a description of the types of actions they actually undertake. 

Most robots today aren’t in the places we’d think to look based on sci-fi stories or dictionary definitions. Most robots come in two types: they’re sidekicks for desktop and server activities at work, or robots that scour the internet to tag and index web content.

All-in-all robots are typically still digital. Put another way, digital robots have come of age much faster than their mechanical cousins. 

(more…)

Read More

Returning To Work After Childbirth And During A Global Pandemic

In this post, we interview Jing Hao, a Software Engineer on the Solutions team. She’s one of the newest mom’s at Diffbot.

 

Hi Jing! Congratulations on the new addition to your family and welcome back to work – we’ve missed you and can’t wait to hear all about your journey into motherhood during a time of uncertainty. I think it’s a good idea to help parents-to-be understand how to navigate the current environment.
(more…)

Read More

Welcome Huzail Ssemakula – Technical Recruiter

Hello everyone,

My name is Huzail Ssemakula (pronounced who-zail semakula). I will be joining Diffbot as a technical recruiter on our amazing recruiting team.

My recruiting journey started at Amazon where I helped grow several teams such as Amazon Prime and the Alexa technologies. Since then, my career took me to Google and Cruise Automation where I focused on working with teams solving Machine Learning problems. I am very impressed with what Diffbot has achieved with such a small group of talented people. I look forward to helping the company achieve its mission, by bringing in more talented individuals to the Diffbot family.

On a personal note, I enjoy spending time with my family doing outside activities. We live in the Seattle area surrounded by nature and parks. I’m also huge on sports, soccer/futbol being my number one. Fun facts; I’m left handed, born and raised in Uganda, and I can speak several languages.

I can’t wait to meet you all in person once the world is back in order 🙂

Read More

Welcome Ariadne Caldwell – Executive Assistant to Diffbot’s CEO and Founder, Mike Tung

Hi everyone, I’m Ariadne Caldwell. Recently, I joined Diffbot as the Executive Assistant to CEO, Mike Tung. For the past five years, I have supported C-Level and high profile Executives across industries such as SaaS, Real Estate and Food & Hospitality. I’m passionate and enthusiastic in helping support teams who solve complex problems with industry leading solutions.

I love working on special projects and company initiatives. In my previous roles I have led social media strategy, creation and execution of a podcast, managed recruitment processes, edited and produced videos, designed brand collateral, and other tasks that go outside of the typical Executive Assistant scope of work. 

My goals are to provide proactive and strategic administrative support across the organization. I believe relationship building is key to forming an inclusive and welcoming company culture.

Born and raised in the Bay Area, I am a San Francisco State University graduate with a Bachelor of Science in Business Administration – International Business. 

I enjoy traveling, scuba diving, writing, reading, and spending time with my family. 

Very excited to be a new member of the Diffbot team! 

Read More

Welcome Ondrej Pacovsky – Machine Learning Engineer

Hi there! I am Ondřej Pacovský, from the mighty Czech Republic. I had just started as a Senior Machine Learning Engineer in the research group.

A little bit about myself – I made my first cash writing software when I was 14 and decided to focus my endeavors making computers smarter, that is, artificial intelligence. After graduating from Charles University and Sussex University, I started working in game development as an AI expert and also a lead developer on the side. I then joined Google, and worked on various machine learning projects, most notably Gmail Priority Inbox and the Google Knowledge Graph. I returned to Prague to co-found Eyen, a company specializing in cryo electron microscopy data analysis and special-purpose GPU development.

With Eyen working beautifully on its own, I was looking for a meaningful opportunity to push the boundaries of AI. The Diffbot’s mission fits that goal perfectly – in fact, what we’re doing here was my initial dream when I was thinking about intelligent computers as a boy: a machine that learns about the world just by observing it.

I greatly enjoy teaching my own small biological brains to be smarter than me. I play ice hockey, soccer, tennis, squash and have climbed a 6000m mountain. I also developed and installed my own smart home and enjoy making wood furniture.

 

Read More

Stories By DQL: Tracking the Sentiment of a City


The story: sentiment of news mentions of Gaza fluctuate by as much as 2000% a week. 90% of news mentions about Minneapolis have had negative sentiment through the first week in June 2020 (they’re typically about 50% negative). Positive sentiment news mentions about New York City have steadily increased week by week through the pandemic.

Locations are important. They help form our identities. They bring us together or apart. Governance organizations, journalists, and scholars routinely need to track how one location perceives another. From threat detection to product launches, news monitoring in Diffbot’s Knowledge Graph makes it easy to take a truly global news feed and dissect how entities being talked about.

In this story by DQL discover ways to query millions of articles that feature location data (towns, cities, regions, nations).

How we got there: One of the most valuable aspects of Diffbot’s Knowledge Graph is the ability to utilize the relationships between different entity types. You can look for news mentions (article entities) related to people, products, brands, and more. You can look for what skills (skill or people entities) are held by which companies. You can look for discussions on specific products.
(more…)

Read More

Stories By DQL: George Floyd, Police, and Donald Trump

We will get justice. We will get it. We will not let this door close.

– Philonise Floyd, Brother of George Floyd

News coverage this week centered on George Floyd, police, and Donald Trump. COVID-19 related news continue to dominate globally.
That’s the macro story from all Knowledge Graph article published in the last week. But Knowledge Graph article entities provide users with many ways to traverse and dissect breaking news. By facet searching for the most common phrases in articles tagged “George Floyd” you see a nuanced view of the voices being heard.

In this story hopefully you can begin to see the power of global news mentions that can be sliced and diced on so many levels. Wondering how to gain these insights for yourself? Below we’ll work through how to perform these queries in detail.

(more…)

Read More

Welcome Steve Peterson – Enterprise Account Executive

Hi everyone, I’m Steve Peterson. I’ve just joined Aron’s team to help with Enterprise Sales to Global 2000 accounts.  Having had some experience with Big Data and Analytics tools at Oracle and IBM, I’m excited to join a company with a vision to distill the entire Web into a structured format easily incorporated by businesses into their Data pipelines. This is a very exciting place to be!  I’m working on helping Diffbot become an essential piece of the data landscape for Billion dollar companies in areas like PR, Marketing, Sales and even areas like M&A.  Most companies never heard of zoom.info 3 years ago, now it’s table-stakes for most Sales orgs. I see Diffbot with a similar future, but in many parts of the Enterprise Data Stack.

I’m based in the north part of Phoenix, just past the first set of mountains heading to Flagstaff.  My wife, Cari, and I have 5 kids between us, and 5 grandkids, with one on the way (we’re thankful she had some kids early in life!).  We enjoy concerts, the pool, camping and boating.  I take my home office buddy Dallas (she’s the sweetest dog you’ve ever met — a grey-pit bull) with me on desert walks almost every morning.

 

Read More