WEEK 5: ENTRY 3

Today at my internship, I worked on activating my Big Query Console within the firm’s authentication and authorization and testing out some of the Queries on the data from the Big Query API. Big Query is a Google Cloud Platform, which is public as well. However, unlike how I can create an account to sign in to the Big Query Console, I had to use my company’s email to access the data housed in the Big Query. The Big Query is quite fascinating because it has the description of the data, the schemas and data types all outlined.  In addition, the console and executable run time is pretty fast and simple to use. 


Like many databases and data, my data is in a relational database format. It is basically arranged in a set of rows and columns. This display eases the struggle of writing the query. One of the queries that my manager needed me to write was , “ A query that returns every host(distinct) and the number of system logs that contain a certain body massage.” Having worked with relational databases in my Database class last semester, it was a very similar approach in using the SELECT statements and other functions when writing SQL Queries. However, there were new syntax and commands like the “LIKE” functions. As a result, I had to spend some moments looking at the documentations and experimenting with those new SQL commands that are apt for the Big Query queries. 


At first, I was able to return the host names and the count of each host name with the body as well, but it was thousands of rows because there were duplicates. Why did I see duplicates? Well, it is because I was not including in my query any command to handle the duplicates. Perhaps I should have expected this prior to running my query, but I did not. To resolve this issue, I added the “DISTINCT” command and “GROUP BY” so that the hot names would not be duplicated. Luckily it worked for me. I have learned that most of what I learned in my database class at Berea College were really handy and I definitely needed to just be more confident and recall what I learned. I wish I had the notes I made during that class with me here, but I would not let that deter my progress and success in implementing the queries because most of the information is in the web.


Comments

Popular posts from this blog

WEEK 4: ENTRY 3

WEEK 6: ENTRY 3

WEEK 7: ENTRY 1