VeriCite DocumentationPolicies and General Info Other InfoHow do you seed your private repository with prior term student submissions?

How do you seed your private repository with prior term student submissions?

Many institutions have prior term student submissions that they would like to add to their repository in bulk. We call this "seeding" your repository. By seeding your institutional repository, you can search against historical student submission data at your institution for the last few years, even though the institution may be new to VeriCite. There is no charge for this service. To seed your repository, follow the steps below.

First, make sure you have permission to index the content.

You should check with the legal counsel at your institution to ensure that you have permission to use student-owned work in this way. For example, if you were using a different plagiarism detection tool in the past, and students were notified that their work would be submitted for electronic plagiarism review, students may have already agreed to such use. Also, your learning management system vendor may require you to provide to them institutional assurances that you have permission to use this data in this manner.

Gather the submissions you would like to load into VeriCite.

The instructions for gathering files will vary depending on where you plan to collect the content. Most institutions get the files from their LMS.

There are no file format restrictions and any folder structure is ok. It is easiest to provide us with zip files, but this isn’t a requirement.

All data should be handled securely.

Note: You may seed your repository with up to 4 years of prior submissions.

BlackBoard

Submit a support ticket to BlackBoard to request a copy of the student files.

Canvas

We provide scripts for you to gather your submissions through the Canvas API.

1. Get the scripts.

Go to https://github.com/vericite/canvas-utils/releases to get the scripts. Information on how to run each script can be found in this project’s README.

2. Get an authorization token within Canvas.

To manually generate a token:

  1. Go to the Account > Settings link in the left menu bar, or navigate to /profile
  2. Under the Approved Integrations section, click the button to generate a new access token.
  3. Once the token is generated, you cannot view it again, and you'll have to generate a new token if you forget it. Remember that access tokens are password equivalent, so keep it secret.

3. Run “list-courses” script to get all of the courses you wish to export.

In order to run this script, you need to know the Canvas Account IDs and Term IDs you wish to export. These IDs can be found in the URL when viewing them in Canvas. (e.g. /accounts/1). Save the script output into a file named “courses.csv”. You can only run one account and one term at a time.

./list-courses -token="9000~aXXXXXXXXXXXXXXXXXXX" -url="https://acmecollege.instructure.com/api/v1/" -accountId=1 -termId=1 > courses.csv

4. Run the “list-course-assignments” script.

Point to your “courses.csv” file for the input parameter “filename”. Optionally, if you want to restrict to only previously Turnitin enabled assignments, set the turnitin parameter to true. (note: this only works for the non-LTI integration and only if turnitin is still enabled). Save the script output into a file named “assignments.csv”

./list-course-assignments -token="9000~aXXXXXXXXXXXXXXXXXXX" -url="https://acmecollege.instructure.com/api/v1/" -filename="courses.csv" > assignments.csv

5. Run the “export-submissions” script.

Point to your “assignments.csv” file for the input parameter “filename”. This will download all attachments for the listed assignments into a folder of your choosing (default is “submissions”). This can take a long while, so make sure your script can run for a long time.

./export-submissions -token="9000~aXXXXXXXXXXXXXXXXXXX" -url="https://acmecollege.instructure.com/api/v1/" -filename="assignments.csv" -outputFolder="submissions"

Tip: You can run all scripts at once with a single command like:

./list-courses -token="9000~aXXXXXXXXXXXXXXXXXXX" -url="https://acmecollege.instructure.com/api/v1/" -accountId=1 -termId=1 | tee courses.csv && ./list-course-assignments -token="9000~aXXXXXXXXXXXXXXXXXXX" -url="https://acmecollege.instructure.com/api/v1/" -filename="courses.csv" | tee assignments.csv && ./export-submissions -token="9000~aXXXXXXXXXXXXXXXXXXX" -url="https://acmecollege.instructure.com/api/v1/" -filename="assignments.csv" -outputFolder="submissions"
D2L/Brightspace

Submit a support ticket to D2L to request a copy of the student files.

Moodle

Your Moodle system administrator or hosting partner can export the data for you.

Sakai

Your Sakai system administrator or hosting partner can export the data from the CONTENT_RESOURCE table using the following query.

SELECT substring_index(RESOURCE_ID,'/',-1) as filename, FILE_PATH as path FROM CONTENT_RESOURCE

Or if you want to restrict files based on previous plagiarism submissions:

SELECT substring_index(cr.RESOURCE_ID,'/',-1) as filename, cr.FILE_PATH as path
FROM CONTENT_RESOURCE cr 
right join CONTENTREVIEW_ITEM ci on cr.RESOURCE_ID = ci.contentId

Submit a support ticket to VeriCite.

Once you have collected all of the submissions you wish to add to your private repository, send in a VeriCite support ticket to request the seeding of your database.

Securely transfer the files.

Upon reciept of your request, we will provide you with upload access information for a secure SFTP location. If you want to upload via a GUI, we recommend Cyberduck.

Remember, there are no file format restrictions and any folder structure is ok. It is easiest to provide us with zip files, but this isn’t a requirement.

Please remember to send up to 4 years of prior submissions only.

Once you have completed your file transfer, let VeriCite support know.

Your submissions will be indexed.

We will seed your private repository as soon as possible, and update the support ticket to let you know when indexing is complete. After indexing, we will delete your uploaded files from the SFTP location. Please make sure to delete your exported files as well.