PinnedPublished inTowards Data ScienceDelta lake with Spark: What and Why?Get to know the storage layer which enabled ACID and updates with SparkAug 22, 20202Aug 22, 20202
Published inGeek CultureFinding the latest date is not as easy as you would thinkUnderstanding how to find the latest value in a date partition column in SparkJul 18, 2022Jul 18, 2022
Systematic Sampling with SparkUnderstanding systematic sampling and its implementationJul 17, 2022Jul 17, 2022
Published inTowards Data ScienceDo Real-Time Data Pipelines Even Exist?Sharing a fresh perspective on real-time data pipelinesJul 12, 20223Jul 12, 20223
Published inTowards Data ScienceGetting hands-on with DBT — Data Build ToolStep by Step Guide to running your first project with DBTJul 5, 20222Jul 5, 20222
Published inTowards Data ScienceStop using the LIMIT clause wrong with SparkUnderstanding spark LIMIT and its performance with large datasetsMay 22, 20221May 22, 20221
Published inGeek CultureShould you use singleton objects in Scala?Understanding singleton objects in ScalaMay 14, 20221May 14, 20221
Published inGeek CultureHow to build a simple text-to-speech converter?Guide on building text to speech converter in PythonFeb 25, 2022Feb 25, 2022
Published inGeek CultureWhat is Data-as-a-Product?Understanding one of the founding principles of Data MeshFeb 23, 2022Feb 23, 2022
Do you need a Macbook to visit Starbucks?A hilarious encounter of my sister’s first visit to StarbucksFeb 16, 20221Feb 16, 20221