Parse zipped PostgreSQL's logs and save them in a parquet file

I'm administrating a large number of PostgreSQL's servers and I get their logs zipped. To analyze them I've done a Spark task for:

1. Unzip the files
2. Parse then logs of PostgreSQL
3. Save (append) the data into a parquet file

In a following post I will show how to query them to get usefull . . .

October 25, 2018

Problem with weighted indexes

One problem with weighted indexes is that few components of the index can move its value when the value of few components is much bigger than the others. That could give misleading conclusions. For example, when small weighted components are not following the trend of the big ones. Some scenarios where . . .

October 09, 2018

Print Markdown in the HTML widget using Markdown package

I've uploaded a Jupyter Notebook in Github explaining two ways to print Markdown in a Jypyter Notebook:

The first option is straightforward, but the second one is much more powerful because can be used with other widgets, . . .

August 17, 2018

Export data from Oracle to MongoDB in Python

Introduction

I had to export some data from an Oracle database to a MongoDB. For this reason I created a python function called export_data_from_oracle_to_mongodb that can be found in my Github.

To make the function more generic, I've there's an optional parameter called transform,where a function can be specified to . . .