paint-brush
The Year of the Graph Newsletter: April 2018by@linked_do

The Year of the Graph Newsletter: April 2018

by George AnadiotisApril 4th, 2018
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

<strong>Graph databases are the hottest thing around right now. Whether you are just getting started, or you are in one of the 51% of organizations already using them, this is the place to get your news and analysis.</strong>
featured image - The Year of the Graph Newsletter: April 2018
George Anadiotis HackerNoon profile picture

Graph databases are the hottest thing around right now. Whether you are just getting started, or you are in one of the 51% of organizations already using them, this is the place to get your news and analysis.

The popularity of graph databases has gone through the roof almost overnight it seems. Everything points this way: the trend lines from database engines, the reports from the Forresters of the world, the response to my graph-related ZDNet posts.

Why is that, and should you care? Do you really need a graph database, and if yes, how do you choose one? That’s the million dollar question. Well, that’s more than one questions actually, and that’s more than a million in value there too. Either way, i can help you answer, starting today.

This is the first edition of the monthly Year of the Graph newsletter. Every month i will collect, republish, and comment on the 10 most important Graph database related news items.

Why me? Ever since i implemented my first graph database prototype in 2005, i have worked on award-winning research, consulted the (then) leading vendor on distributed query implementation, and lead teams of all sizes and shapes working with graph databases.

I have also published a number of reports and articles with analysis on big data, distributed systems and analytics with Gigaom and ZDNet. Perhaps more importantly though:

I work for nobody else but me, and by extension, you. No fluff, just stuff. No automatically harvested or sponsored posts. No vendor affiliations or hidden agendas. Just hand picked, curated content, and objective, concise analysis.

  1. Why the year of the graph? Glad you asked. In a nutshell, because graph databases are coming of age and getting attention. Some of it has to do with the infrastructure and the technology enabling graph to get mainstream. The use cases have always been there, and heavyweights like AWS and Microsoft are moving in this space.


The year of the graph: Getting graphic, going native, reshaping the landscape | ZDNet_It's official: graph databases are a thing. That's the consensus here on Big on Data among fellow contributors Andrew…_www.zdnet.com

2. How do you model you graph? This is the question Dilyan Damyanov from Snowplow explores. Using graphs is part of what Snowplow does, with an emphasis on managing events. The analysis on different answers on how to model event data as a graph sheds some light on the fine art of modelling graphs.


Building a model for event data as a graph - Snowplow_In recent months we've been busy expanding the variety of storage targets available for Snowplow users to load Snowplow…_snowplowanalytics.com

3. So, you have a Hadoop data lake. There’s graph data in there, and “graphy” queries you can do on that data. ArangoDB’s Max Neunhöffer gives examples of graph use cases, and provides an introduction to some graph algorithms you can use, as well as a walkthrough of how to get started with ArangoDB.


Fishing for graphs in a Hadoop data lake_Graphs in the sense of linked data with vertices and edges are everywhere these days: social networks, computer…_www.oreilly.com

4. Graph is a much more natural and efficient paradigm for doing multiple joins (hops) than relational data. Furthermore, graphs also work well together with another mega-trend, machine learning. The inherent structure in graphs can be leveraged in your machine learning algorithms, as Graphistry’s Leo Meyerovich discusses with O’Reilly’s Ben Lorica.


Graphs as the front end for machine learning_The O'Reilly Data Show Podcast: Leo Meyerovich on building large-scale, interactive applications that enable visual…_www.oreilly.com

5. In their original inception data lakes grant universal access to data in their native formats, yet lack the necessary metadata and semantic consistency for long term sustainability. What can give you that? The combination of enterprise-wide ontologies, taxonomies, and terminology, says Franz Inc’s Jan Aasman.


Harmonizing big data with an enterprise knowledge graph_One of the most significant results of the big data era is the broadening diversity of data types required to solidify…_www.infoworld.com

6. Graph databases come in 2 main flavors. RDF is one of them, and Linked Data is a collection of related standards, including taxonomies, that enable graph-based navigation and querying. If you’re interested in the foundamendals of this approach, this account of the Network for Information and Knowledge Exchange’s workshop may be for you.


Making true connections in a complex world - Graph database technology and Linked Open Data_Conrad Taylor writes: The first NetIKX meeting of 2018, on 25 January, looked at new technologies and approaches to…_netikx.wordpress.com

7. If you think a taxonomy sounds fancy, how about an ontology? As Teodora Petkova from Ontotext argues, this is not about a philosophical debate on the essence of being. In order to define what something “is” (to a computer program), information technology resorts to the use of ontologies. Some graph databases use those — this is how you build a knowledge graph.


Ontology, Meaning and Knowledge Management_A program that wants to compare or combine information across the two databases has to know that these two terms are…_ontotext.com

8. The other graph database flavor is LPG — Labelled Property Graphs. One of the differences between RDF and LPG is support for taxonomies and ontologies, and as a consequence of this, inference: RDF has it, LPG does not. Or at least, that was the case up to now, says Thorsten Liebig from derivo GmbH. Their solution wants to support RDF-like reasoning on LPG.


Neo4j: A Reasonable RDF Graph Database & Reasoning Engine [Community Post]_Discover how the Neo4j graph database and the Cypher graph query language perform as an OWL reasoning engine when…_neo4j.com

9. Titan, an open source LPG graph database, was an important piece of the graph database world. As Titan is no longer maintained, IBM and others have been supporting a group of people who stepped up and adopted its codebase, forking it as JanusGraph. Ted Wilmes from Expero is one of the architects of JanusGraph, and he recently spoke about the state of JanusGraph in 2018.

10. What is the current status quo in the graph database world? All the latest developments from AWS, Cambridge Semantics, Neo4j and Tigergraph, in the “best overview of the graph database world(s) so far“, plus analysis on the options for querying graphs.


Back to the future: Does graph database success hang on query language? | ZDNet_Fifty years ago, relational databases were neither ubiquitous nor standardized. The story of how we went from the…_www.zdnet.com

Would you like to receive the latest Year of the Graph Newsletter in your inbox? Easy — just signup below. Have some news you think should be featured in an upcoming newsletter? Easy too — drop me a line here.

Originally published at http://linkeddataorchestration.com/2018/04/03/yearofthegraph/