Using Apache Drill to Access Parquet Files in PowerBI
When you are working with data, especially larger data sets, you will come across parquet files. Parquet is a binary columnar storage format which is efficient for several large data use cases both in terms of compression and speed.
How AI Is Aiding Repression
Highly recommended reading - How Artifical Intelligence is Reshaping RepressionSome points of note:“Around the world, AI systems are showing their potential for abetting repressive regimes and upending the relationship between citizen and state, thereby accelerating a global resurgence of authoritarianism.”AI technology, primarily in the form of facial recognition is being adopted by security forces, driven, in part, with investment from China.Zimbabwe is implementing facial recognition - this is a country that recently carried out large post-election crackdowns.
AI and the Art of Lying
Interesting article - “Will AI end the Art of Lying?”Some points of note:Swarm Intelligence: the concept that more minds are greater than one is being explored with interesting results from an AI perspective
Prepare for Exam 70-774 - Performing Cloud Data Science with Azure Machine Learning
I recently passed Exam 70-774 - Performing Cloud Data Science with Azure Machine Learning and thought it may be helpful to provide some guidance on how best to prepare for the exam.
Building A Great Data Lake (or How to Avoid a Data Swamp)
What is a Data Lake?A Data Lake is a term that gets thrown around a fair amount, often in conjunction with big data. But what does it really mean?At its core, its a central repository for storing unlimited amounts of data from many different sources that you can bring analytics to bear on top of to gain insights.