# StataCosineSimilarity **Repository Path**: Shutter_Zor/StataCosineSimilarity ## Basic Information - **Project Name**: StataCosineSimilarity - **Description**: 用Stata计算余弦相似性 - **Primary Language**: Unknown - **License**: MIT - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2022-06-04 - **Last Updated**: 2022-06-06 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # StataCosineSimilarity Calculating cosine similarity with Stata $$ cosine\ similarity = \frac{\boldsymbol{A}\cdot\boldsymbol{B}}{||\boldsymbol{A}||\times ||\boldsymbol{B}||}=\frac{\sum A_i \times B_i}{\sqrt{\sum (A_i)^2}\sqrt{\sum (B_i)^2}} $$ # example 句子A:$[0,1,1,0]$ 句子B:$[1,2,2,1]$ $$ cs = \frac{0\times1+1\times2+1\times2+0\times1}{\sqrt{0^2+1^2+1^2+0^2}\times\sqrt{1^2+2^2+2^2+1^2}}=\frac{4}{\sqrt{2\times10}}=2/\sqrt{5}\approx 0.894 $$