This is the brief write-up for the Introduction to Bloom training session held on 9th June 2021. You can watch a recording of the session here. Alternatively, you should be able to follow along using the repo as well.
Other useful links:
We use the Kaggle Olympics Data Set for this example. Specifically, the dataset contains information from the Summer and Winter Olympics from 1896 to 2016.
Due to the limitations in Neo4j Aura Free tier on the number of nodes and relationships permitted, we are only using the Winter Games dataset. Not using Neo4j Aura Free? Feel free to remove the 'winter' filter to access the full data set.
You have a choice of databases, including:
-
Neo4j Aura (including free tier)
-
Neo4j Sandbox
-
Neo4j Desktop
-
etc.
For Neo4j Aura Free:
-
Sign in & click “Create a database”
-
Give your database a name
-
Selected “Shared” database size
-
Click “Create Database”
-
Make a copy of the generated password - keep it safe!
For Neo4j Sandbox:
-
Sign in & click “Blank sandbox”
What’s the difference between Aura and Sandbox?
Neo4j Sandbox is a time-limited instance of the Neo4j database, it lasts for a maximum of 10 days (when extended, otherwise three days without extension), whereas Neo4j Aura free is free for life.
-
Copy all the queries from here
-
Click on open/Neo4j Browser to start the Browser
-
Username: neo4j
-
Paste the query into the query window and press the play button to start
New to Neo4j Browser? Check out this developer guide.
Start the Bloom App. You will be prompted to auto-generate a perspective.
Explore the auto-generated perspective:
-
What categories have been loaded?
-
What category properties have been selected to display?
Try creating your own perspective:
-
Click on ‘Choose a Perspective’ -→ ’Create a Perspective’
-
Add the categories
-
Select properties to display
-
Switch back to the auto-generated perspective
Look up Maxime Dufour-Lapointe:
-
Explore the information about Maxime Dufour-Lapointe using the Card List
-
Try right-clicking the node and expand
-
What more do we learn about Maxime Dufour-Lapointe?
-
Expanding the Team node (cross-check the colour with the Category list on the right), what more do we learn?
Solutions
Click to see solution
Maxime Dufour-Lapointe was part of the Canada Freestyle Skiing team, and competed with her siblings Chlo Dufour-Lapointe and Justine Dufour-Lapointe!
How would you find the answer to: What Olympic games was Yelena Dubok a part of?
-
How many different patterns can you come up with that provide the answer?
Click to see solution
Lots of different ways, including
-
Yelena Dubok team games
-
Yelena Dubok part of competed
-
Yelena Dubok team competed in
-
Yelena Dubok part of team competed
-
etc.
-
How many Gold medals has Claudia Pechstein won?
-
How many teams has Lamine Guye been in?
-
Can you find some athletes that have represented more than one country?
-
What different ways are Olivier Jenot and Patrice Servelle linked? (hint: think about the paths that may link them)
Click to see solution
-
Claudia Pechstein has won 8 Gold medals
-
Lamine Guye has been in 8 teams
-
Country Team Athlete Team Country helps us find athletes who have represented more than one country
-
Olivier Jenot and Patrice Servelle linked through same games (Torino and Sochi) and country (Monaco)
Wouldn’t it be nice to find Athletes without knowing their middle names? Let’s add a search phrase for that!
Add search phrases to find names
-
Cypher code from here
-
Test out your search phrases!
-
Add a relationship between City and Country for Cities that have held the Olympics more than once
-
Feel free to search the City locations if you don’t know them off hand!
We are going to use the Page Rank scores we have on Athlete to Athlete, based on shared Teams.
First of all, why not add some appropriate icons for all the nodes in use?
Using the dynamic sizing option, use the smallest and largest node sizes possible. The following range values may be useful:
-
Min value: 0.15
-
Max value: 4.85
Click to see solution
Samuel Bode Miller is the 'most connected' athlete, with a Page Rank score of 4.82