2024 Jun 12 2:22 PM - edited 2024 Jun 19 8:25 PM
This discussion thread is to submit your solution for Week 2 "Words as Vectors" if you are participating in the June 2024 developer challenge "Multi-model in SAP HANA Cloud" 🤓
👉Please use this separate thread to ask your questions and discuss issues. Keep this thread for your submissions only.
1️⃣ Post your Week 1 "Setup" submissions here: https://community.sap.com/t5/application-development-discussions/submissions-for-quot-sap-hana-cloud...
3️⃣ Post your Week 3 "Image Embeddings" submissions here: https://community.sap.com/t5/application-development-discussions/submissions-for-quot-sap-hana-cloud...
2024 Jun 12 10:12 PM
My submission for week 2. I tried several sets of words because the query did not always return my expected results.
2024 Jun 13 8:30 AM
Thank you for trying it out @MioYasutake
How did you find this week's content? Was it helpful to learn anything new?
2024 Jun 13 9:03 AM
@Vitaliy-R
Executing the scripts was straightforward, and I didn't encounter any obstacles. However, I believe it would be more beneficial for learning if there were elements that required us to think and implement solutions on our own.
2024 Jun 13 9:16 AM
Thank you for your feedback @MioYasutake and @Cocquerel !
It seems I overestimated the difficulty of running this, as I assumed many people would be trying different aspects of this exercise for the very first time and I wanted to make it easier.
I see it has become too easy and resulted in a tutorial instead of a challenge...
Ok, I will try to make it more challenging next week 🤓
2024 Jun 12 10:47 PM
here is mine
2024 Jun 13 8:30 AM
Thank you for giving it a try @Cocquerel
How did you find this week's content? Was it helpful to learn anything new?
2024 Jun 13 9:01 AM
The task was extremely guided; the only difficulty was finding words that yielded a meaningful result. I would have preferred a slightly less guided exercise because I retain information better when I have to search a bit on my own rather than simply executing tasks without any reflection. I would have preferred something that looks more to a real challenge.
2024 Jun 13 9:26 AM - edited 2024 Jun 13 5:42 PM
Point taken. On top of what I replied to you and @MioYasutake below already, I must confess it took me 4 hours to make the 3CosMul calculation work:
((1+ COSINE_SIMILARITY("V4"."WV", "V3"."WV"))/2 * (1+COSINE_SIMILARITY("V4"."WV", "V2"."WV"))/2) / ((1+COSINE_SIMILARITY("V4"."WV", "V1"."WV"))/2 + 0.000001) AS "3COSMUL_SCORE"
for the analogy queries.
Sounds, I should have left it to your next time as a challenge 🤓
2024 Jun 13 8:30 AM
Hello , Here is the week 2 :
regards
Sagar
2024 Jun 13 8:32 AM
Have you considered a career in medicine @sagarsv ? 🩺
How did you find this week's content? Was it helpful to learn anything new?
2024 Jun 13 12:09 PM
HAHA, NO @Vitaliy-R
I felt the same , It's was too straight forward.
For me " COSINE_SIMILARITY " is very new , I understood what it does a bit , I need to deep dive. which I will 🙂
2024 Jun 13 8:34 AM
My submission for week 2
2024 Jun 13 8:52 AM
Your last example made me scratch my head, and then smile @mvinci
I checked that EDE is the NYSE's "Empire District Electric Company". So, this is where "Empire" is coming from 🙂
Maybe you should have tried what is there in "Athens" similar to "pasta" in "Rome"? 😋
How did you find this week's content? Was it helpful to learn anything new?
2024 Jun 13 9:09 AM
Excellent exercise - maybe too easy as the notebooks were super cool and prompt (too perfect) 😀
Only one query had an error (i am investigating the error in my account). I loaded 3m records on my BTP internal tenant (not trial... )
2024 Jun 13 9:23 AM
Thank you for your feedback @mvinci and it is in line with what @MioYasutake and @Cocquerel wrote. Note taken: https://community.sap.com/t5/application-development-discussions/submissions-for-quot-sap-hana-cloud...
I reduced the number of loaded records from 3000000 to 100000 because, for 3 million records, I used a HANA instance bigger than what we all have in the Trial. Plus, it takes quite a long time for single-threaded processing to work with a 3M dataset. It was at the pick of BAS trial dev spaces as well, when it comes to disk and RAM.
Maybe that can be taken as a challenge now, how to tune the environment to work with 3M complete dataset in Trial 🤓
2024 Jun 13 1:59 PM
I fixed the "cannot allocate enough memory..." error by increasing the statement_memory_limit_threshold up to 80% in the global.ini file, section: memorymanager.
2024 Jun 13 9:11 AM
If Rome has pasta, Athens has noodles...
2024 Jun 13 9:45 AM
🤣
2024 Jun 13 10:17 AM
2024 Jun 14 7:53 AM
Hi,
Week2 submission, but I was expecting different relationships 😄
2024 Jun 14 4:27 PM
Your result:
sky --> cloud
ground --> VMs
made my day! 🤓
2024 Jun 15 6:58 PM
My submission for week 2
I tried several sets of words because the query did not always returned results. And some outcome I did not understand. By increasing the number of results the some of them became a bit more reasonable.
word1='Car'
related_word1='mobility'
word2='Train'
lookup_wordrelated_word3COSMUL_SCORE
Train | fluidity | 0.8257285345399955 |
Train | transport | 0.8233577046806748 |
Train | movement | 0.79910186795009 |
Train | disabilities | 0.7945178072857146 |
Train | train | 0.7854194316740297 |
Дякую!
Have a nice weekend,
Dirk
2024 Jun 15 8:51 PM
Thanks for joining @DirkO
It seems "mobility" semantically represents the medical term of "being able to move", plus "train" might be taken as a verb for "training" 🙂
But when I changed "mobility" to "Mobility" then related words were:
2024 Jun 16 4:33 PM
Week 2 Submission! Having fun trying different words!
|
|
| SPOILER ALERT! It looks like Grogu is not Yoda's son!!! Nick is 😮
|
|
|
| 🤔 |
2024 Jun 17 8:38 AM
Funny examples indeed.
In the case of Jupiter, it referred to the city in Florida, whose neighbor is Palm Beach. This is where SAP TechEd on Tour takes place alongside ASUG Tech Connect in November https://events.asug.com/event/9a5f27ca-d742-45bb-85e7-a6597b952900/summary, but who knows, maybe we will have it on the Moon still during our lifetime 🤓
And in the case of name+surename for Luke, it might related to the name+surename of the music producer Nick Yoda.
2024 Jun 16 5:57 PM
2024 Jun 16 10:45 PM
2024 Jun 17 7:59 AM
My week 2 submission:
USA | Washington_DC | 0.7220002249341672 |
which is the correct capital city
But when word2 = 'Spain'
We get
Spain | Catalan | 0.865680415938654 |
M
2024 Jun 17 9:38 AM
2024 Jun 17 2:25 PM - edited 2024 Jun 17 2:26 PM
🍻
2024 Jun 17 11:00 AM
Hello,
Week-2 submission:
2024 Jun 17 9:51 PM
2024 Jun 18 11:01 AM
Hi @Leela_Sankar_PV Have you tried any other combinations of words to come up with your own analogy questions? This is what was asked at the end of the challenge for the Week 2: to replace these values:
word1='king'
related_word1='man'
word2='queen'
and post a screenshot with your example.
2024 Jun 19 7:26 AM
Hi @Vitaliy-R , Yes i was using different combination. Now i have replaced with below one and result is,
word1='king' related_word1='man' word2='queen'
2024 Jun 17 10:02 PM
Hi Vitaliy, all worked fine, also with using 500000 words, also interesting: showing more than LIMIT 1 results.
A little bit stranger:
word1='Munich'
related_word1='Oktoberfest'
word2='Stuttgart'
(It is not "Septemberfest", but "Cannstatter Wwasen" - but it usually takes playe in september)
Experimenting with capitals worked for Brasil, Braslilia -> Australia, Canberra, but not for Suisse, Bern.
2024 Jun 18 6:31 AM
2024 Jun 18 11:19 AM
2024 Jun 18 10:34 PM
I never heard about taco 🌮 in Spain 🇪🇸
...but maybe it is similar to kebab 🥙 in Poland 🇵🇱 these days.
🤓
2024 Jun 19 10:07 AM
I would say, the answer is completely matches reality, as taco is very common in Spain. In any restaurant, you will be offered tacos at every corner.
In Spain, "taco" refer to a small, thick slice of food, such as a slice of ham, cheese, or other types of cured meats. These are commonly served as tapas, which are small dishes or snacks