Ai Jobs
January 26, 2025 at 03:45 AM
*Data science interview questions:*
I appeared for Google interviews couple of times , got rejected all the times but it was a good learning experience. For interview prep , I learnt a lot many new things which helped me a lot in my career. Appeared for this 1 yr back.
*Designation: Data Scientist*
*Coding Questions:* (they asked many questions , cant mention everything because of space constraints)
1. Given the database schema:
Children(child_name STR, age INT)
Toys(toy_name STR, price DEC)
List(child_name STR, toy_name STR)
Sample values:
Children
Child_Name Age
--------------
Maria 5
Teo 10
Kate 8
Mark 6
Toys
Toy_Name Price
----------------
PS3 200
Skate 50
Scooter 100
Lego 50
Nerf 30
List
Child_Name Toy_Name
-------------------
Kate PS3
Teo PS3
Teo Skate
Kate Scooter
Maria Lego
Kate Nerf
Teo Nerf
Maria Skate
• List (in alphabetical order) the names of the toys asked by children under 9 years old.
o Follow up: How can you make the results unique/distinct?
• Who are the children spending over 200?
• What is the total amount spent by each child?
*Case study Question :*
1. Imagine the following scenario - In 1M videos, there are 50K bad videos.
There is a Model A and Model B
Model A catches 75K bad videos 50K actually bad 25K good but marked bad
Model B catches 25K bad videos all 25K actually bad - misses 25K bad
which one would you choose and why?
Tell me difference between weighted recall and F1-score ? Where to use ROC-AUC ?
2. Tell me about different frauds happening in your favorite google product (I spoke about gpay)
3. Can you tell me some methods in order to mitigate these frauds ? (Data science + Simple UI improvements / popups / awareness methods)
5. Suppose you are working as a data scientist at Meta & you see that there are users who are reporting some genuine instagram profiles as spam /fraud. It is wasting a lot of energy/time/money for inspection. How will you identify such accounts which report genuine profiles as spam with
a. ML b. Non- ML approach ?
Tell me strategy.
*Gen AI questions :*
1. How will you solve gpay frauds with the help of Gen AI soln (take any approach and statement of your choice and ellaborate)
2. How will you improve a RAG pipeline ? what is HyDE ?
3. How will you tackle with hallucination ?
4. Can you tell me about infinte context window models ?
*Statistics Questions:*
1. What is CLT ?
2. We need to draw sample from class of boys and girls staying in different zip codes , having different scores in tests. How will you draw a representative sample from this population ?
3. If your answer is stratification then how will you choose weights ? (Apart from # event \ # Total)
4. Can you tell me what is chi-square goodness of fit ?
5. What will be ditribution of gpay transactions in India ? (Need to explain : Weekdays , Weekends , Holiday seasons)
*Hope it helps!* ❤️
To post your company jobs, WhatsApp at +917588467267
❤️
👍
🙏
😮
19