LORA: LOW-RANK ADAPTATION OF LARGE LANGUAGE MODELS
Many applications in natural language processing rely on adapting one large-scale, pre-trained language model to multiple downstream applications.
Tue Jan 14 2025
Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark
Can MLLMs effectively serve as judges in the multimodal domain, and how closely do their evaluations align with human preferences?
Sat Jan 11 2025
Latent Dirichlet Allocation - Part.1
LDA's main function is not dimension reduction but Topic Modeling, and the definition of Topic Modeling, borrowing from the expression in the paper, is as follows
Thu Dec 07 2023
From Seq2Seq to Transformer - Part.2
The transformer model represents another significant leap in the NLP field after the Seq2Seq model and the attention mechanism.
Sat Oct 07 2023
From Seq2Seq to Transformer - Part.1
The attention mechanism was proposed to improve the Seq2Seq model. To understand the attention mechanism well, you must understand the Seq2Seq model.
Wed Oct 04 2023
Generative AI - Part.2 π€
The learning and generation principles of Diffusion Models, which model multi-layer transformations and inverse transformations between data distribution and noise distribution, share many similarities with Flow Models.
Thu Jul 13 2023
Generative AI - Part.1 π€
So, on what principle do generative models, the core of generative AI, learn and operate? What does 'generation' mean? In the fields of statistics or machine learning, generation is defined as sampling from a probability distribution in data space, which was not observed or used during the learning process.
Wed Jul 12 2023
How Ray Tracing works Part.1 π
Today we'll look at <Ray tracing> the dominant technique used in the graphics industry.
Sun Oct 02 2022
Image Style Transfer (CNN) πΌοΈ
Paper Review of Image Style Transfer Using Convolutional Neural Networks.
Sat Jan 29 2022
02. MLOps: Model Monitoring - Weights and Bias π
Basics of MLOps using basic vision project.
Sun Jan 23 2022
Physics-based Human Motion Estimation and Synthesis from Videos π€ΈββοΈ
Paper review of Physics-based Human Motion Estimation and Synthesis from Videos
Wed Jan 19 2022
01. MLOps: Project Settings π»
Basics of MLOps using basic vision project.
Sun Jan 16 2022
Gradient Descent ποΈ
Gradient descent method is also called as steepest descent method.
Sat Oct 16 2021
Adaptive Moment Estimation (ADAM) π
ADAM is the use of Gradient Descent with Momentum and RMSProp at the same time.
Sat Oct 09 2021
Linear Regression π
Linear regression is a linear approach for modeling the relationship between a scalar response and one or more explanatory variables.
Sat Oct 09 2021
KL Divergence π
KL Divergence (Kullback-Leibler) also known as Relative Entropy is all about the difference between two distributions.