Meeting 20111214

From Archivematica
Jump to navigation Jump to search

Development[edit]

  • Mike got the Archivematica rights admin roughed out (except for selecting rights holders which I'll add in the near future).
  • Joseph was doing a small amount of source cleanup, and getting the license updated
  • Joseph some work on Issue 809: Ensure processing is done in the processing directory
  • Joseph spent some time on this, and think I understand it a bit better: Issue 520: Multi-track audio fails to normalize correctly
  • Joseph did Issue 834: Virus check fails when filename contains accented characters
  • Still having problems with Issue 835:Generate METS fails when non-ascii characters included in metadata
    • This is due to the default interface for mysql being latin
    • A lot of python functions won't accept utf strings
  • Jesus has been working on the uploadDIP, he managed to parse rsync output so he can know more about the file's transfer status

Deployment[edit]

  • CVA has 6 AIPs in archival storage, and counting!
  • Evelyn is going back to CVA on Tuesday, we'll be running VANOC SIPs through then, so they'll need a remote update there, say on Tuesday morning?

Testing[edit]

  • Evelyn is doing the usual testing: filing bugs and reviewing fixes

Documentation[edit]

Chat log[edit]

(10:31:03 AM) epmclellan: meeting time?
(10:31:59 AM) epmclellan: I can take notes
(10:32:27 AM) epmclellan: dev news?
(10:32:44 AM) djjuhasz: mcantelon: now you go!
(10:32:58 AM) mcantelon: Haha... I've got the Archivematica rights admin roughed out (except for selecting rights holders which I'll add in the near future).
(10:32:59 AM) berwin221: I was doing a small amount of source cleanup, and getting the license updated
(10:33:08 AM) epmclellan: mcantelon: yaay!
(10:33:54 AM) berwin221: some work on Issue 809:	Ensure processing is done in the processing directory
(10:34:27 AM) djjuhasz: Awesome mcantelon - adding features in your first 2 weeks on the job, must be some sort of precedent :)
(10:34:29 AM) berwin221: I spent some time on this, and think I understand it a bit better: Issue 520: Multi-track audio fails to normalize correctly
berwin22 berwin221 
(10:34:52 AM) epmclellan: berwin221 I was really glad you tackled that
(10:35:04 AM) epmclellan: I tested it yesterday with several different videos, looks good
(10:35:40 AM) berwin221: did this  Issue 834 :      Virus check fails when filename contains accented characters
(10:36:11 AM) epmclellan: I tested that and it is fixed
(10:36:53 AM) epmclellan: Dec 23 is code freeze, I'm going to go over the issues list again today
(10:37:01 AM) epmclellan: try to separate the must haves from the nice to haves
(10:37:12 AM) mcantelon: Cool.
(10:37:19 AM) Sevein: thank you epmclellan 
(10:37:21 AM) epmclellan: np
(10:37:25 AM) epmclellan: thanks for all the dev
(10:37:27 AM) berwin221: We're still having problems with Issue 835:Generate METS fails when non-ascii characters included in metadata
(10:37:47 AM) epmclellan: this is based on our METS file being ascii, yes?
(10:37:56 AM) epmclellan: instead of UTF-8
(10:38:06 AM) berwin221: no
(10:38:21 AM) berwin221: it's based on the default interface for mysql being latin
(10:38:28 AM) epmclellan: oh
(10:38:32 AM) berwin221: well the latin character set
(10:38:36 AM) epmclellan: yes
(10:38:46 AM) Sevein: I've suggested a patch
(10:38:52 AM) Sevein: but it breaks Archivematica
(10:38:56 AM) berwin221: jesus found a way to switch it to utf-8
(10:38:57 AM) epmclellan: hmm
(10:39:09 AM) berwin221: but a lot of python functions won't accept utf strings
(10:39:22 AM) Sevein: I updated the issue with some details, hope berwin221 can take a look
(10:39:33 AM) berwin221: then please assign the issue to me
(10:40:00 AM) berwin221: If you are owner, I'm assuming you're working on it still :)
(10:40:13 AM) djjuhasz: python y u no like UTF
(10:40:13 AM) djjuhasz: ?
(10:40:20 AM) ***mcantelon haz sadface
(10:40:22 AM) berwin221: some functions...
(10:40:26 AM) epmclellan: me too
(10:40:35 AM) epmclellan: sad face, I meant
(10:40:38 AM) epmclellan: I like UTF-8
(10:40:42 AM) berwin221: python was basically ascii and unicode was tagged on
(10:40:43 AM) mcantelon: Hehe.
(10:41:12 AM) djjuhasz: berwin221: I think *every* scripting language is ASCII with Unicode tagged on :(
(10:41:18 AM) djjuhasz: PHP sure is
(10:41:42 AM) Sevein: ascii? I guess you mean iso-8859-1
(10:42:03 AM) berwin221: yeah... it's frustrating that some functions will take a unicode string, and some won't
(10:42:06 AM) djjuhasz: ASCII was what we called it before the ISO got all up ins
(10:42:19 AM) Sevein: berwin221: so if I add a comment to an issue and you are not the owner, do you read it?
(10:42:21 AM) berwin221: we digress
(10:42:47 AM) berwin221: Sevein, not always.
(10:42:51 AM) Sevein: ok
(10:43:20 AM) berwin221: I forgot to star myself on that issue
(10:44:50 AM) epmclellan: any more dev news?
(10:45:07 AM) epmclellan: deployment
(10:45:18 AM) epmclellan: CVA has 6 AIPs in archival storage, and counting!
(10:45:32 AM) berwin221: other small dev stuff, like Issue 812:	Restructuring a folder in the transfer or SIP creation directories
(10:45:32 AM) berwin221: nothing major
(10:45:44 AM) berwin221: using what revision?
(10:46:04 AM) epmclellan: Austin updated on Friday, not sure what revision it is
(10:46:11 AM) berwin221: ARTi?
(10:46:31 AM) epmclellan: the AIPs are all the same, ca. 1000 TIFF images, no preservation normalization
(10:46:49 AM) Sevein: (dev) I have been working in the uploadDIP, I managed to parse rsync output so I can know more about the files transfer status
(10:47:06 AM) epmclellan: all filenames formatted: eg A000012 etc
(10:48:17 AM) epmclellan: DIPs are being generated for the AIPs at CVA, but they won't be uploaded until CVA's data migration is done
(10:48:43 AM) berwin221: It would be good to know which revision they are running, so we know which fixes were applied. there were multiple commits friday
(10:48:45 AM) ARTi: berwin221: not sure, Sevein can you check remotely?
(10:48:53 AM) Sevein: yes sure
(10:49:00 AM) Sevein: (the access to CVA is documented, btw)
(10:49:29 AM) ARTi: oh, cheers ;]
(10:49:30 AM) berwin221: I guess we should check that it's the same revision on the clients then?
(10:50:07 AM) epmclellan: I'm going back to CVA on Tuesday, we'll be running VANOC SIPs through then
(10:50:18 AM) epmclellan: so we'll need a remote update there, say on Tuesday morning?
(10:50:37 AM) ARTi: berwin221: I think I noticed a change as I was running the svn check out on all the machines, pretty sure I got them on the same revision
(10:50:50 AM) berwin221: k
(10:51:15 AM) Sevein: r2003
(10:51:48 AM) berwin221: so they don't have the fix for 
(10:51:49 AM) berwin221: Issue 814:	Normalization event and normalized object being assigned th
(10:52:03 AM) epmclellan: doesn't matter for the AIPs they have right now
(10:52:10 AM) epmclellan: no preservation normalization taking place
(10:52:22 AM) epmclellan: I made sure that the AIP structure wasn't affected by any outstanding issues
(10:53:06 AM) berwin221: can you elaborate epmclellan?
(10:53:24 AM) berwin221: oh, I think I understand now
(10:53:35 AM) epmclellan: the AIPs are production-ready because they are quite simple
(10:53:41 AM) epmclellan: no preservation normalization etc
(10:53:53 AM) epmclellan: so they wouldn't be affected by pending fixes
(10:54:12 AM) epmclellan: but VANOC SIPs will be more complex
(10:54:21 AM) epmclellan: so we need to do updates before we start those
(10:54:32 AM) epmclellan: and as usual we have a crushing deadline :)
(10:54:52 AM) epmclellan: that's why I'm going there on Tuesday
(10:55:37 AM) epmclellan: on to testing?
(10:56:04 AM) epmclellan: lots of testing, including at CVA
(10:56:20 AM) epmclellan: just the usual, filing bugs and reviewing fixes
(10:56:50 AM) epmclellan: any documentation news?
(10:56:54 AM) berwin221: Do you happen to recall what the AIP/DIP sizes were at COV?
(10:57:04 AM) epmclellan: I have all the data
(10:57:11 AM) berwin221: :)
(10:57:17 AM) epmclellan: I'm going to plug it into the scalability testing page
(10:57:24 AM) epmclellan: I set compression at 5
(10:57:28 AM) berwin221: cool, th anks
(10:57:32 AM) epmclellan: np
(10:57:36 AM) djjuhasz: epmclellan has all the data in her hands
(10:57:41 AM) mcantelon: Awesome!
(10:57:49 AM) epmclellan: the whole data in my hands!
(10:57:56 AM) epmclellan: or in my email, anyway
(10:58:04 AM) djjuhasz: the little bitty data in her hands
(10:58:17 AM) epmclellan: heh
(10:58:18 AM) djjuhasz: emals
(10:58:21 AM) djjuhasz: emails
(10:58:56 AM) epmclellan: think this meeting is done, yes?
(10:59:06 AM) berwin221: seconded