Skip to content

Commit

Permalink
reformat
Browse files Browse the repository at this point in the history
  • Loading branch information
atqy committed May 5, 2022
1 parent 7243dee commit 7687bf1
Showing 1 changed file with 2 additions and 4 deletions.
6 changes: 2 additions & 4 deletions prep_data/text_data/preprocessing_text_data.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -49,7 +49,7 @@
"metadata": {},
"outputs": [],
"source": [
"! python -m pip install --upgrade pip \n",
"! python -m pip install --upgrade pip\n",
"! pip install -U 'sagemaker>=2.15.0' spacy gensim==4.0.0 textblob emot==2.1 autocorrect"
]
},
Expand Down Expand Up @@ -181,9 +181,7 @@
"filename = \"training.1600000.processed.noemoticon.csv\"\n",
"s3.Bucket(bucket).download_file(prefix + \"/\" + filename, filename)\n",
"# we will showcase with a smaller subset of data for demonstration purpose\n",
"text_data = pd.read_csv(\n",
" filename, header=None, encoding=\"ISO-8859-1\", low_memory=False, nrows=10000\n",
")\n",
"text_data = pd.read_csv(filename, header=None, encoding=\"ISO-8859-1\", low_memory=False, nrows=10000)\n",
"text_data.columns = [\"target\", \"tw_id\", \"date\", \"flag\", \"user\", \"text\"]"
]
},
Expand Down

0 comments on commit 7687bf1

Please sign in to comment.