Hackernoon logoThe Year of the Graph Newsletter: April 2018 by@linked_do

The Year of the Graph Newsletter: April 2018

George Anadiotis Hacker Noon profile picture

@linked_doGeorge Anadiotis

George's got tech, data, and media, and he's not afraid to use them.

Graph databases are the hottest thing around right now. Whether you are just getting started, or you are in one of the 51% of organizations already using them, this is the place to get your news and analysis.

The popularity of graph databases has gone through the roof almost overnight it seems. Everything points this way: the trend lines from database engines, the reports from the Forresters of the world, the response to my graph-related ZDNet posts.

Why is that, and should you care? Do you really need a graph database, and if yes, how do you choose one? That’s the million dollar question. Well, that’s more than one questions actually, and that’s more than a million in value there too. Either way, i can help you answer, starting today.

This is the first edition of the monthly Year of the Graph newsletter. Every month i will collect, republish, and comment on the 10 most important Graph database related news items.

Why me? Ever since i implemented my first graph database prototype in 2005, i have worked on award-winning research, consulted the (then) leading vendor on distributed query implementation, and lead teams of all sizes and shapes working with graph databases.

I have also published a number of reports and articles with analysis on big data, distributed systems and analytics with Gigaom and ZDNet. Perhaps more importantly though:

I work for nobody else but me, and by extension, you. No fluff, just stuff. No automatically harvested or sponsored posts. No vendor affiliations or hidden agendas. Just hand picked, curated content, and objective, concise analysis.

  1. Why the year of the graph? Glad you asked. In a nutshell, because graph databases are coming of age and getting attention. Some of it has to do with the infrastructure and the technology enabling graph to get mainstream. The use cases have always been there, and heavyweights like AWS and Microsoft are moving in this space.

2. How do you model you graph? This is the question Dilyan Damyanov from Snowplow explores. Using graphs is part of what Snowplow does, with an emphasis on managing events. The analysis on different answers on how to model event data as a graph sheds some light on the fine art of modelling graphs.

3. So, you have a Hadoop data lake. There’s graph data in there, and “graphy” queries you can do on that data. ArangoDB’s Max Neunhöffer gives examples of graph use cases, and provides an introduction to some graph algorithms you can use, as well as a walkthrough of how to get started with ArangoDB.

4. Graph is a much more natural and efficient paradigm for doing multiple joins (hops) than relational data. Furthermore, graphs also work well together with another mega-trend, machine learning. The inherent structure in graphs can be leveraged in your machine learning algorithms, as Graphistry’s Leo Meyerovich discusses with O’Reilly’s Ben Lorica.

5. In their original inception data lakes grant universal access to data in their native formats, yet lack the necessary metadata and semantic consistency for long term sustainability. What can give you that? The combination of enterprise-wide ontologies, taxonomies, and terminology, says Franz Inc’s Jan Aasman.

6. Graph databases come in 2 main flavors. RDF is one of them, and Linked Data is a collection of related standards, including taxonomies, that enable graph-based navigation and querying. If you’re interested in the foundamendals of this approach, this account of the Network for Information and Knowledge Exchange’s workshop may be for you.

7. If you think a taxonomy sounds fancy, how about an ontology? As Teodora Petkova from Ontotext argues, this is not about a philosophical debate on the essence of being. In order to define what something “is” (to a computer program), information technology resorts to the use of ontologies. Some graph databases use those — this is how you build a knowledge graph.

8. The other graph database flavor is LPG — Labelled Property Graphs. One of the differences between RDF and LPG is support for taxonomies and ontologies, and as a consequence of this, inference: RDF has it, LPG does not. Or at least, that was the case up to now, says Thorsten Liebig from derivo GmbH. Their solution wants to support RDF-like reasoning on LPG.

9. Titan, an open source LPG graph database, was an important piece of the graph database world. As Titan is no longer maintained, IBM and others have been supporting a group of people who stepped up and adopted its codebase, forking it as JanusGraph. Ted Wilmes from Expero is one of the architects of JanusGraph, and he recently spoke about the state of JanusGraph in 2018.

10. What is the current status quo in the graph database world? All the latest developments from AWS, Cambridge Semantics, Neo4j and Tigergraph, in the “best overview of the graph database world(s) so far“, plus analysis on the options for querying graphs.

Would you like to receive the latest Year of the Graph Newsletter in your inbox? Easy — just signup below. Have some news you think should be featured in an upcoming newsletter? Easy too — drop me a line here.

Originally published at http://linkeddataorchestration.com/2018/04/03/yearofthegraph/


Join Hacker Noon

Create your free account to unlock your custom reading experience.