Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Support
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
A
Automate CV Processing Code
Project overview
Project overview
Details
Activity
Releases
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Issues
0
Issues
0
List
Boards
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Analytics
Analytics
CI / CD
Repository
Value Stream
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
2022-298
Automate CV Processing Code
Commits
74bc48f7
Commit
74bc48f7
authored
May 14, 2022
by
Chamila Dilshan
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
Code 4
parent
df9a97b9
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
30 additions
and
0 deletions
+30
-0
Code 4
Code 4
+30
-0
No files found.
Code 4
0 → 100644
View file @
74bc48f7
plt.figure(figsize=(20,15))
plt.xticks(rotation=90)
ax=sns.countplot(x="Category", data=df)
plt.grid()
cat = df['Category'].value_counts().reset_index()
plt.figure(figsize=(20,12))
plt.pie(cat['Category'], labels=cat['index'], autopct='%.2f%%')
plt.title('Category Distribution')
plt.show()
df['Resume']
0 Skills * Programming Languages: Python (pandas...
1 Education Details \r\nMay 2013 to May 2017 B.E...
2 Areas of Interest Deep Learning, Control Syste...
3 Skills ⢠R ⢠Python ⢠SAP HANA ⢠Table...
4 Education Details \r\n MCA YMCAUST, Faridab...
...
957 Computer Skills: ⢠Proficient in MS office (...
958 â Willingness to accept the challenges. â ...
959 PERSONAL SKILLS ⢠Quick learner, ⢠Eagerne...
960 COMPUTER SKILLS & SOFTWARE KNOWLEDGE MS-Power ...
961 Skill Set OS Windows XP/7/8/8.1/10 Database MY...
Name: Resume, Length: 962, dtype: object
Basic Data Cleaning
remove blank empty lines, remove newlines \r\n
df['Resume'] = df['Resume'].apply(lambda x: re.sub(r'\n{2,}', "", x)) # remove blank empty lines
df['Resume'] = df['Resume'].apply(lambda x: re.sub(r'\r\n', "", x)) # remove newlines \r\n
df['Resume']
\ No newline at end of file
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment