Category: Neo4J

Neo4j 5 Error Messages

Recently we noticed some customers having these types of warning messages in their debug.log file: These warnings occur if these additional jvm settings are missing from the neo4j.conf. This would impair the ability of the JVM to manage memory when it became contentious and as a result you see errors begin to appear in the […]

Neo4j – H3 Datasets

H3 allows us to help make sense of large amounts of data. For this blog series, we will use the NYC Taxi Data set and add in the NYC Taxi Zones, New York Counties, NYC Boroughs and NYC Buildings. I also added in Open Street Map POI data using the Python notebook from my colleague […]

Neo4j – H3 Library – Update

Post author By dave fauth
Post date February 4, 2023
Categories In Geospatial, Neo4J
No Comments on Neo4j – H3 Library – Update

After about four years (where did that time go?), I have circled back to Neo4j and H3 geospatial data processing. Since version 3.4, Neo4j has a native geospatial datatype. Neo4j uses the WGS-84 and WGS-84 3D coordinate reference system. Within Neo4j, we can index these point properties and query using our distance function or you query within a bounding […]

Neo4j – Uber H3 – Geospatial

Post author By dave fauth
Post date February 19, 2019
Categories In Data Analysis, Data Modeling, Geospatial, Neo4J, Search, Uncategorized
2 Comments on Neo4j – Uber H3 – Geospatial

We are going to take a slight detour with regards to the healthcare blog series and talk about Uber H3. H3 is a hexagonal hierarchical geospatial indexing system. It comes with an API for indexing coordinates into a global grid. The grid is fully global and you can choose your resolution. The advantages and disadvantages […]

Neo4j – Leveraging a graph for healthcare search

Post author By dave fauth
Post date February 18, 2019
Categories In Data Analysis, Neo4J, Search
1 Comment on Neo4j – Leveraging a graph for healthcare search

Graph-based search is intelligent: You can ask much more precise and useful questions and get back the most relevant and meaningful information, whereas traditional keyword-based search delivers results that are more random, diluted and low-quality. With graph-based search, you can easily query all of your connected data in real time, then focus on the answers […]

Modeling events in Neo4j to look for patterns

Post author By dave fauth
Post date February 18, 2019
Categories In Data Modeling, Neo4J
1 Comment on Modeling events in Neo4j to look for patterns

Recently, some of the prospects that I work with have wanted to understand event data and felt like a graph database would be the best approach. Being new to graphs, they aren’t always sure of the best modeling approach. There are some resources available that talk about modeling events. For example, Neo4j’s own Mark Needham […]

Accessing Hive/Impala from Neo4j

Post author By dave fauth
Post date November 16, 2015
Categories In Hadoop, Impala, Neo4J, Uncategorized
No Comments on Accessing Hive/Impala from Neo4j

Quite frequently, I get asked about how you could import data from Hadoop and bring it into Neo4j. More often than not, the request is about importing from Impala or Hive. Today, Neo4j isn’t able to directly access an HDFS file system so we have to use a different approach. For this example, we will […]

Neo4j – New Neo4j-import

Neo4j has recently announced the 2.2 Milestone 2 release. Among the exciting features is the improved and fully integrated “Superfast Batch Loader”. This utility (unsurprisingly) called neo4j-import, now supports large scale non-transactional initial loads (of 10M to 10B+ elements) with sustained throughputs around 1M records (node or relationship or property) per second. Neo4j-import is available […]

Hadoop, Impala and Neo4J

Post author By dave fauth
Post date March 12, 2014
Categories In Hadoop, Neo4J, opendata, Uncategorized
No Comments on Hadoop, Impala and Neo4J

Back in December, I wrote about some ways of moving data from Hadoop into Neo4J using Pig, Py2Neo and Neo4J. Overall, it was successful although maybe not at the scale I would have liked. So this is really attempt number two at using Hadoop technology to populate a Neo4J instance. In this post, I’ll use […]