Skip to content

tranngocminhhieu/toeic-600-words-scraped-dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

7 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

TOEIC 600 Words Scraped Dataset

Learning English can be done in many ways, and not everyone wants to rely solely on language learning apps. Some prefer to access vocabulary datasets and integrate them into their own tools, such as Anki.

To support those looking for a structured TOEIC 600 words dataset and to enhance my data scraping skills, I created this project.

featured-image.png

πŸŽ‰ Special Thanks

A huge thank you to the TFLAT team for their dedication in creating high-quality English learning content and applications. This project utilizes their valuable resources from TFLAT Blog to compile and structure TOEIC vocabulary data.

πŸ›  Tools Used

  • Programming Language: Python
  • Libraries: requests, beautifulsoup4, re, numpy, pandas

πŸ“¦ Scraped Data

  • Vocabulary dataset available in Excel and CSV formats.
  • Images categorized by topic.
  • Audio files grouped by topic for better accessibility.

πŸ“Š Statistics

  • Total words: 615
  • Total topics: 50
  • Min-Max words per topic: 12-13

🏷️ Word Type Distribution

  • Noun (n.): 41.67%
  • Verb (v.): 37.42%
  • Adjective (adj.): 13.24%
  • Adverb (adv.): 6.37%
  • Noun, Verb (n, v.): 0.33%
  • Preposition (perp.): 0.16%
  • Verb, Noun (v, n.): 0.16%
  • Phrasal Verb (phr.v.): 0.16%
  • Noun Phrase (n.ph.): 0.16%

πŸš€ How This Helps

This dataset is perfect for learners who want to:

  • βœ… Build their own study materials using tools like Anki.
  • βœ… Explore TOEIC vocabulary in an organized and structured way.
  • βœ… Access images and audio for better memorization.

If this project helps you, feel free to share it with others who might benefit! 😊

About

Provides learners with ready-to-import datasets for flashcard apps like Anki and Quizlet, including vocabulary, IPA, examples, images, and audio.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors