Geo Location
Geo location is something we often need in ETL work. And while we had a step that worked in PDI 5.x and earlier releases, we just noticed it's not currently working.
Until this morning, that is :p
I just forked Matt's initial project and applied the relevant changes to make it compatible with Pentaho 6+
The basics
Well, easy to understand... We have an IP address, we want to know where it comes from!
Once I execute this, I get the following result:
I am indeed in Porto Salvo, Portugal, so this is right. Can't get any easier than this!
Making it work
So, how to make this work? First, you have to get the plugin from the PDI marketplace
This plugin is available through the marketplace. Just go ahead and install it.
After installing it and restarting PDI, you'll see the GeoIP Lookup step in the lookup folder. Configuring it is straightforward: You point to the stream field containing the IP address, point to the IP database files and specify what fields you want back:
Getting the IP Database files
You need to get the files from MaxMind, and from my experience these guys do a great job here. They have some great commercial offerings but also a GeoLite database for country and city location. You can get them from here
And you should be done! This even works great in a map reduce job
More...
Geo location is something we often need in ETL work. And while we had a step that worked in PDI 5.x and earlier releases, we just noticed it's not currently working.
Until this morning, that is :p
I just forked Matt's initial project and applied the relevant changes to make it compatible with Pentaho 6+
The basics
Well, easy to understand... We have an IP address, we want to know where it comes from!
Once I execute this, I get the following result:
I am indeed in Porto Salvo, Portugal, so this is right. Can't get any easier than this!
Making it work
So, how to make this work? First, you have to get the plugin from the PDI marketplace
This plugin is available through the marketplace. Just go ahead and install it.
After installing it and restarting PDI, you'll see the GeoIP Lookup step in the lookup folder. Configuring it is straightforward: You point to the stream field containing the IP address, point to the IP database files and specify what fields you want back:
Getting the IP Database files
You need to get the files from MaxMind, and from my experience these guys do a great job here. They have some great commercial offerings but also a GeoLite database for country and city location. You can get them from here
And you should be done! This even works great in a map reduce job
More...