Skip to content

Commit e01fa95

Browse files
committed
movie data
1 parent d4a1506 commit e01fa95

3 files changed

Lines changed: 8 additions & 1 deletion

File tree

.gitignore

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,6 @@
11
.ipynb_checkpoints
22
.DS_Store
3+
code/datasets/movie/aclImdb_v1.tar.gz
34

45
# Byte-compiled / optimized / DLL files
56
__pycache__/

code/datasets/README.md

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,4 +15,10 @@ Sebastian Raschka, 2015
1515
### wdbc
1616

1717
- used in chapter 6
18-
- source: [https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+(Diagnostic)](https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+(Diagnostic))
18+
- source: [https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+(Diagnostic)](https://archive.ics.uci.edu/ml/datasets/Breast+Cancer+Wisconsin+(Diagnostic))
19+
20+
### movie
21+
22+
- used in chapter 8
23+
- movie dataset converted into a 2-column CSV format: The first column (`review`) contains the text, and the second column (`sentiment`) denotes the polarity, where 0=negative and 1=positive. The first 25,000 are the training samples and the remaining 25,000 rows are the test samples from the "Large Movie Review Dataset v1.0," respectively.
24+
- source: [http://ai.stanford.edu/~amaas/data/sentiment/](http://ai.stanford.edu/~amaas/data/sentiment/)
25.3 MB
Binary file not shown.

0 commit comments

Comments
 (0)