Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactor #49

Merged
merged 52 commits into from
Nov 29, 2022
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
52 commits
Select commit Hold shift + click to select a range
921f086
add tests
dalejn Apr 25, 2022
5ac8bef
added existing fucntions to scripts
jastiso Apr 25, 2022
9470d69
get rid of r files
dalejn Apr 25, 2022
2513e4e
added imports to function scripts
jastiso Apr 25, 2022
a6abde6
Merge branch 'refactor' of https://github.com/dalejn/cleanBib into re…
jastiso Apr 25, 2022
ff38c50
add test aux
dalejn May 2, 2022
22bc19e
breaking first cell into fn units
jastiso May 2, 2022
eec793e
added pipeline and got imports working
jastiso May 9, 2022
65465c9
added needed imports and arguments for 1st cell functions
jastiso May 10, 2022
f1988b9
added function for finding unused citations in aux file
jastiso May 11, 2022
f4d2c9f
automatically removed duplicates
jastiso May 17, 2022
3a0c1eb
updating pipelines
jastiso May 17, 2022
18a7e28
Removed the R code and added code to check the number of credits allo…
murphyka May 24, 2022
09388d0
adding bibcheck to pipeline, defining API functions
jastiso Jun 13, 2022
123d5c4
ipython req
dalejn Jul 15, 2022
85ecff6
pandas req
dalejn Jul 15, 2022
c1d2b4f
scipy req
dalejn Jul 15, 2022
2c33711
update versions
dalejn Jul 15, 2022
100cc9d
update versions
dalejn Jul 15, 2022
f8a3964
update versions
dalejn Jul 15, 2022
91bb1ea
update versions
dalejn Jul 15, 2022
476a5d8
debugging pipeline for clean data
Jul 22, 2022
bbbf4d8
Merge branch 'refactor' of https://github.com/dalejn/cleanBib into re…
Jul 22, 2022
de95a8e
removing duplicate queries
jastiso Jul 22, 2022
0cc0df3
added check for names that have already been queried
jastiso Aug 5, 2022
406665f
made histogram plotting in python
jastiso Aug 5, 2022
7c9fe8b
fixed bug in histogram
jastiso Aug 5, 2022
a9ab4b1
added env file
jastiso Aug 12, 2022
694882c
fixed file paths
jastiso Aug 12, 2022
3753a42
new environment
jastiso Aug 12, 2022
0881e21
move yml
dalejn Aug 12, 2022
450db3f
move yml
dalejn Aug 12, 2022
8173379
trying third env
jastiso Aug 12, 2022
47a96cc
fixing merge
jastiso Aug 12, 2022
cf988f5
working on query reporting
jastiso Aug 12, 2022
28f1350
relax dependencies
dalejn Nov 28, 2022
255a258
Merge branch 'refactor' of https://github.com/dalejn/cleanBib into re…
dalejn Nov 28, 2022
bd52355
relax dependencies
dalejn Nov 28, 2022
28fc386
relax dependencies
dalejn Nov 28, 2022
8561e75
upgrade ethnicolr
dalejn Nov 28, 2022
dc1950f
update versions
dalejn Nov 28, 2022
cfc9643
update python
dalejn Nov 28, 2022
7d6205c
rm environment.yml
dalejn Nov 28, 2022
bd898d0
rm r
dalejn Nov 28, 2022
e727ae5
update python
dalejn Nov 28, 2022
340b50b
rm tf-base
dalejn Nov 28, 2022
be2bfa7
relax dependencies
dalejn Nov 28, 2022
8944c1f
clean up
dalejn Nov 28, 2022
b2956e9
update for tests
dalejn Nov 29, 2022
14c9dc7
update README
dalejn Nov 29, 2022
d6f2b8b
update launch binder link
dalejn Nov 29, 2022
4f23402
prepare for merge
dalejn Nov 29, 2022
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -1,2 +1,3 @@
.DS_Store
.ipynb_checkpoints/*
env/*
13 changes: 12 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -59,6 +59,7 @@ And editorials and research highlights of this work:
* Z. Budrikis (2020). Growing citation gender gap. *Nature Reviews Physics*. [doi: https://doi.org/10.1038/s42254-020-0207-3](https://doi.org/10.1038/s42254-020-0207-3)
* D. J. Sweet (2021). New at cell press: the inclusion and diversity statement. *Cell*, 184(1), 1-2. [doi: https://doi.org/10.1016/j.cell.2020.12.019](https://www.sciencedirect.com/science/article/pii/S0092867420316895?via%3Dihub)
* B. Rowson, S.M. Duma, M.R. King, I. Efimov, A. Saterbak, and N.C. Chesler (2021). Citation diversity statement in BMES journals. *Annals of Biomedical Engineering*, 1-3. [doi: https://doi.org/10.1007/s10439-021-02739-6](https://link.springer.com/article/10.1007/s10439-021-02739-6)
* D. Kwon (2022). The rise of citational justice: how scholars are making references fairer. *Nature*, 603(7902), 568-571. [doi: https://doi.org/10.1038/d41586-022-00793-1](https://www.nature.com/articles/d41586-022-00793-1)

For `.pdf` and `.tex` templates of the statement, see the `/diversityStatement` directory in this repository.

Expand Down Expand Up @@ -118,7 +119,7 @@ ___

2. Launch the coding environment. Please refresh the page if the Binder does not load after 5-10 mins.

[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/dalejn/cleanBib/34b3a896b6fe0961b2dfc3ad22214cf45da48cca?urlpath=/tree/)
[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/dalejn/cleanBib/refactor?urlpath=/tree/)

3. Open the notebook `cleanBib.ipynb`. Follow the instructions above each code block. It can take 10 minutes to 1 hour complete all of the instructions, depending on the state and size of your `.bib` file. We expect that the most time-consuming step will be manually modifying the `.bib` file to find missing author names, fill incomplete entries, and fix formatting errors. These problems arise because automated methods of reference mangagers and Google Scholar sometimes can not retrieve full information, for example if some journals only provide an author's first initial instead of their full first name.

Expand Down Expand Up @@ -333,6 +334,7 @@ ___
* Christopher Camp
* Eli Cornblath
* Jordan Dworkin
* Kieran Murphy
* Jordan Matelsky
* Cleanthis Michael
* Kendra Oudyk
Expand All @@ -344,6 +346,15 @@ ___
* Dale Zhou

# Changelog
* __11/28/2022__
* major refactor (thanks, Jeni!)
* removed SOS notebook
* upgraded all packages and libraries
* all R code now in Python
* majority of code now call functions located in utils/
* automate removal of identified duplicates and self-citations
* query how many credits left and save some by only querying unique names (thanks, Kieran!)
* fixes issue introduced by [protobuf upgrade](https://github.com/protocolbuffers/protobuf/issues/10051)

* __9/14/2021__
* force Binder to load with [classic theme](https://discourse.jupyter.org/t/mybinder-org-using-jupyterlab-by-default/10715) because new default breaks SOS notebook R code
Expand Down
Empty file added __init__.py
Empty file.
916 changes: 66 additions & 850 deletions cleanBib.ipynb

Large diffs are not rendered by default.

Binary file removed diversityStatement/.DS_Store
Binary file not shown.
28 changes: 0 additions & 28 deletions environment.yaml

This file was deleted.

2 changes: 0 additions & 2 deletions install.R

This file was deleted.

1 change: 0 additions & 1 deletion postBuild

This file was deleted.

120 changes: 120 additions & 0 deletions requirements.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,120 @@
absl-py
appnope
argon2-cffi
argon2-cffi-bindings
asttokens
astunparse
attrs
backcall
beautifulsoup4
bibtexparser==1.3.0
bleach
cachetools
certifi
cffi
charset-normalizer
cycler
debugpy
decorator
defusedxml
entrypoints
ethnicolr==0.9.1
executing
fastjsonschema
flatbuffers
fonttools
folium
future
gast
google-auth
google-auth-oauthlib
google-pasta
grpcio
h5py
habanero==1.2.2
idna
imgaug
importlib-metadata
ipykernel
ipython
jedi
Jinja2
jsonschema
jupyter
jupyter-client
jupyter-console
jupyter-core
jupyterlab-pygments
jupyterlab-widgets
keras==2.8.0
keras-preprocessing==1.1.2
kiwisolver
latexcodec==2.0.1
Markdown
MarkupSafe
matplotlib
matplotlib-inline
mistune
nbclient
nbconvert
nbformat
nest-asyncio
notebook
numpy==1.22.3
oauthlib
opt-einsum
packaging
pandas==1.4.3
pandocfilters
parso
pexpect
pickleshare
Pillow
prometheus-client
prompt-toolkit
protobuf==3.20.1
psutil
ptyprocess
pure-eval
pyasn1
pyasn1-modules
pybtex==0.24.0
pycparser
Pygments
pylatexenc==2.10
pyparsing
pyrsistent
python-dateutil
pytz
PyYAML
pyzmq
qtconsole
QtPy
requests
requests-oauthlib
rsa
scipy==1.7.3
seaborn==0.11.2
Send2Trash
six
soupsieve
stack-data
tensorboard==2.8.0
tensorboard-data-server==0.6.0
tensorboard-plugin-wit==1.8.1
tensorflow==2.8.1
tensorflow-estimator
termcolor
terminado
tinycss2
tornado
tqdm
traitlets
typing-extensions
urllib3
wcwidth
webencodings
Werkzeug
widgetsnbextension
wrapt
zipp
2 changes: 1 addition & 1 deletion runtime.txt
Original file line number Diff line number Diff line change
@@ -1 +1 @@
r-3.6-2019-04-12
python-3.10.5
Empty file added tests/__init__.py
Empty file.
Loading