jaime.lucero

Home » Collections for Jaime Lucero

About Jaime Lucero

Jaime is a Bachelor of Science in Computer Science major in Data Science student at the University of Southeastern Philippines. His journey is driven by the goal of becoming a developer specializing in machine learning and AI-driven solutions that create meaningful impact.

How to Generate Simple Document Embeddings with Python

2025-12-10T05:58:07+00:00

Document embeddings are one of the simplest ways to give machines an understanding of text, and in our previous article, Document Embeddings Explained: A Guide for Beginners, we explored how they turn entire documents into dense numerical vectors that capture meaning and context. Now that you understand what embeddings are and why they’re useful for tasks like semantic search, classification, and clustering, this tutorial will show you how to generate them in practice using Python. Whether you’re working with short paragraphs, long articles, or a collection of documents, the steps in this guide will help you create embeddings that you [...]

How to Generate Simple Document Embeddings with Python2025-12-10T05:58:07+00:00

Document Embeddings Explained: A Guide for Beginners

2025-12-08T05:12:54+00:00

Every day, billions of lines of text, emails, articles, and messages are created online. Making sense of all this unstructured data is one of the toughest challenges in modern AI. Document embedding is a fundamental concept that overcomes this problem. These are dense, numerical vectors that transform words, sentences, or entire documents into meaningful points in a high-dimensional space. These vectors capture the meaning and context of the original text. Because of this, machine learning models can measure similarity and perform tasks like topic classification, semantic search, and recommendation. What are Document Embeddings? Document embeddings convert text into numerical representations, [...]

Document Embeddings Explained: A Guide for Beginners2025-12-08T05:12:54+00:00

Data Preprocessing Guide for Beginners in ML

2025-10-22T06:09:23+00:00

Before machine learning (ML) models can generate predictions or insights, the raw data must first be cleaned, organized, and transformed into a suitable format for the model. This process is known as data preprocessing. It is the foundation of every successful ML project. It ensures that the model learns from high-quality, consistent, and well-structured input rather than noisy, incomplete, or biased information. In this hands-on guide, we’ll walk through how to transform a raw Kindle eBook dataset from Kaggle into machine learning-ready data using Google Colab, a free cloud-based environment that allows you to write and execute Python code directly [...]

Data Preprocessing Guide for Beginners in ML2025-10-22T06:09:23+00:00

AWS, Azure, and GCP Certifications are consistently among the top-paying IT certifications in the world, considering that most companies have now shifted to the cloud. Upskill and earn over $150,000 per year with an AWS, Azure, or GCP certification!

Follow us on LinkedIn, Facebook, or join our Slack study group. More importantly, answer as many practice exams as you can to help increase your chances of passing your certification exams on your first try!