Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

MIGRATED TO ONEDRIVE: WorldCat Data Sync workflow (Current)

Anchor
Sending files
Sending files
Sending files

  • In Alma, go to Admin > Manage Sets
  • Find WorldCat Find logical, public set  WorldCat has catdate AND has inventory AND unsuppressed (All) 
  • Filter using WorldCat batch processing filter (TEU no WCAT) and name the resulting set so that it can be easily identified
  • Go to Admin > Run a Job > Export Bibliographic Records
  • Find the resulting, filtered file and export it
    • Physical format: Binary; all other settings remain at default
  • Download and open in MarcEdit. Be sure UTF8 is selected under Default Character Encoding when using MarcBreaker.
  • Remove records likely in the revision queue.
    • File>>Select records for edit
    • 997$a>>Import file
    • Use regex + YYYYMMD[D-1].|YYYYMMDD.
    • Invert selections
    • Export selected

  • [HT: skip this step] - Remove records with existing 079s. These should be processed separately.
    • File>>Select records for edit
    • 079>>Import file
    • Search for: Display field not found
    • Export selected
  • Remove duplicate records (question) LF: not sure why there are duplicates
    • Tools > Record deduplication
      • Control field: Field/Subfield = 001
      • Dedup Keeping: first
      • Options: Remove duplicate items
  • Run the task OCLCbatch_presend
  • Find duplicate 035s and 997s and edit/clean appropriately
    • Edit > Edit shortcuts > Field edits > Find records containing duplicate tags: 
      • 035$a(OCoLC)
      • 997$c
  • Run validation, fix errors, and save the fileGet list of sent records and create new set in Alma
    • Do a "Find all" search for =001[space]
    • Click the copy icon at bottom left to copy the results to your clipboard
    • Paste into Notepad++ then with "Regular expression" selected,
      • On the Replace tab:
        • Find: ^\=001\s\s([0-9]*)\t.*
        • Replace: $1
    • Add MMS ID to top of the file and save it
    • Use this list to create an itemized set in Alma
  • Compile into a MARC file (Funnel icon) using naming convention: 1024500.teu.YYMMDD.mrc
  • Update the tracking spreadsheet 
  • Send file via FTP
  • Host: filex-m1.oclc.org
  • Username: fx_teu
  • Password: cwD14lnd
  • Path: metacoll(/metacoll/in/bibAccess the xfer file to find the above path. Click on the backslash and then click on the xfer file folder Image Removed).
  • Admin > Run a job > Records > Records sent to WorldCat - waiting on sent file

Anchor
Loading files
Loading files
Loading files

  • Leanne Holly and Molly receive email updates from OCLC when certain processes are completed. The email that indicates files are ready has the subject: 

    TEU -- The report (Bibliographic Record Processing Summary -- Data Synchronization) you requested for DATE

  • Files are retrieved via FTP (see above) from /metacoll/reports and will begin:
    • metacoll.TEU.datasync.1024500-1024500
  • Download all files and open the .mrc file in MarcEdit. Be sure UTF8 is selected under Default Character Encoding when using MarcBreaker.
  • Run the task OCLCbatch
  • Compile into a MARC file, but add "_loadLOAD-" at the end beginning of the filename so it doesn't overwrite the original file from OCLC.
  • In Alma, go to Resources > Import and run WorldCat data sync overlay
  • "Select file" and attach the _load "LOAD-" file from above and then Submit
  • View the report and note any issues
  • Go to "Bibliographic records imported", Total records imported and Create itemized title set
  • Go to Admin > Run a job and select select Post-load for   WorldCat   data sync (TEU)
  • Select the set created aboveView the report and fix any issues


Panel
titleOCLC holding symbols


CodeLibrary(s)
TEUMain, Ambler, Blockson, Harrisburg, Charles, DSS, Media Services, Remote Storage, SCRC
tulawLaw
TMYGinsburg
QBAPodiatry

Note: Rome and Japan do not have OCLC accounts and their records do not get loaded into WorldCat. Both Law (tulaw) and Health Sciences (HSL=TMY; Podiatry=QBA) have their own OCLC accounts and their holdings are not currently included in this workflow.


Panel
titleTEU search algorithm

Physical titles - WorldCat has catdate AND has inventory AND unsuppressed (All) - OCLC holding libraries only - Logical

Brief Level contains keywords "10" AND

Has inventory equals "Yes" AND

Tag Suppressed (Title) equals "No"

Holding Library = See above

Filter by WorldCat batch processing filter (TEU no WCAT)

rule "997.d.TEU and not exists 997.e.WCAT"
when
    (exists "997.d.TEU") AND
    (not exists "997.e.WCAT")
then
    set indication."true"
end



Panel
titleProcessing schedule

2nd and 4th Friday of the month



Panel
borderStylesolid
titleHelpful Links and Files

Workflow tracking spreadsheet

MarcEdit task - OCLCbatch_presend (download)

MarcEdit task - OCLCbatch (download)

MarcEdit local validation file (download)Processing files in OwlBox (Leanne's)

Collection Manager login



Panel
titleData Sync collections


Collection IDCollection namePurposeNote
1024500Batchload (P010869)Manual batchload of Paley+ holdingsWas project P010869
1013302P012765 (Marcive)Holdings set by Marcive for govdocsWas project P012765 






...