[BARNA-194] Reading gene sequences close to the chromosome border Created: 16/Jul/12 Updated: 09/Oct/15 Status: Project: Component/s: Affects Version/s: Fix Version/s: Reopened Barna Package Simulator Simulator 1.0.2 (API 1.11) Type: Reporter: Resolution: Labels: Remaining Estimate: Time Spent: Original Estimate: Bug Priority: Micha Sammeth Assignee: Unresolved Votes: git-branch-master, git-branch-sam Not Specified Attachments: Simulator 1.1 (API 1.15) Major Micha Sammeth 1 Not Specified Not Specified reopen.par Description [LIBRARY] Configuration Mode: RH PWM: No RT MIN: 200 RT MAX: 5500 Processing Fragments ********** OK (00:03:57) 34569813 mol: in 34569746, new 67, out 34565024 avg Len 214.94418, maxLen 744 start amplification [INFO] Loading default PCR distribution preparing transcript sequences ******* ERROR [ERROR] Error while preparing sequences: Problems reading sequence Chr3: pos -1, len 100, check whether chromosomal sequence exists / has the correct size java.lang.RuntimeException: Problems reading sequence Chr3: pos -1, len 100, check whether chromosomal sequence exists / has the correct size at barna.model.Graph.readSequence(Graph.java:585) at barna.model.Graph.readSequence(Graph.java:473) at barna.model.Transcript.getSplicedSequence(Transcript.java:459) at barna.flux.simulator.fragmentation.Fragmenter.getMapTxSeq(Fragmenter.java:375) at barna.flux.simulator.fragmentation.Fragmenter.process(Fragmenter.java:539) at barna.flux.simulator.fragmentation.Fragmenter.call(Fragmenter.java:258) at barna.flux.simulator.SimulationPipeline.call(SimulationPipeline.java:424) at barna.flux.simulator.SimulationPipeline.call(SimulationPipeline.java:53) at barna.commons.launcher.Flux.main(Flux.java:212) Caused by: java.lang.NegativeArraySizeException at barna.model.Graph.readSequence(Graph.java:507) at barna.model.Graph.readSequence(Graph.java:473) at barna.model.Graph.readSequence(Graph.java:529) ... 8 more FAILED Comments Comment by Jenkins [ 17/Jul/12 ] Integrated in Barna-All-Nightly-Builds #49 Fix reading across genome border (BARNA-194) (Revision 921f46c66fadea1e92bcfe23de2ae3b6f34c5ba7) Result = SUCCESS micha : 921f46c66fadea1e92bcfe23de2ae3b6f34c5ba7 Files : barna.model/src/test/java/barna/model/splicegraph/GraphTest.java barna.model/src/test/java/barna/model/GraphTest.java barna.model/src/main/java/barna/model/Graph.java barna.model/src/test/java/barna/model/splicegraph/SplicingGraphTest.java Comment by Anonymous [ 19/Jul/12 ] Is there a nightly build I could download with this error fixed? Comment by Micha Sammeth [ 19/Jul/12 ] Yes, the fix should already be included in yesterday's nightly build http://sammeth.net/artifactory/barna-nightly/barna/barna.simulator/1.0.4-20120718010037/ After testing a problematic case that has been provided by some user, we will include the fix in a simulator release. Comment by Anonymous [ 19/Jul/12 ] Hi, I am Arun, the one who posted about this on the flux-wiki forum. I downloaded the nightlybuild and ran it again, it ends up with the same error. But now, it happens after filtering and during PCR amplification stage. I am creating a new post with this and all another error and 2 other questions that have been bothering me for quite a while now in a new post here: http://fluxcapacitor.wikidot.com/forum/t-529477/a-comprehensive-error-report-of-what-i-havefound-so-far. It would be great if you could spare some of your time in answering these questions! Arun. Comment by Micha Sammeth [ 26/Jul/12 ] I am getting the following error with the latest nightly build (I am trying to simulate expression from a reference genome with many contigs, hence many short chromosomes) SEQUENCING] getting the reads Initializing Fragment Index Indexing ********** OK (00:00:03) 2818814 lines indexed (2818814 fragments, 2630 entries) sequencing [WARN] skipped line ACCK01000001.1 mgb2gtf gene 148 1200 . + . transcript_id "CATMIT_00001"; gene_id "CATMIT_00001"; *[ERROR] Error while sequencing : Problems reading BED object ACCK01000003.1 14785 14861 ACCK01000003.1:14115-14861W:CATMIT_00041:5:747:466:746:A/2 0 - . . 0,0,0 1 76 0 java.lang.RuntimeException: Problems reading BED object ACCK01000003.1 14785 14861 ACCK01000003.1:14115-14861W:CATMIT_00041:5:747:466:746:A/2 0 - . . 0,0,0 1 76 0 at barna.model.bed.BEDobject2.readSequence(BEDobject2.java:335) at barna.flux.simulator.Sequencer.createQSeq(Sequencer.java:600) at barna.flux.simulator.Sequencer$SequenceWriter.writeRead(Sequencer.java:936) at barna.flux.simulator.Sequencer$Processor.process(Sequencer.java:771) at barna.flux.simulator.Sequencer.sequence(Sequencer.java:247) at barna.flux.simulator.Sequencer.call(Sequencer.java:120) at barna.flux.simulator.SimulationPipeline.call(SimulationPipeline.java:436) at barna.flux.simulator.SimulationPipeline.call(SimulationPipeline.java:53) at barna.commons.launcher.Flux.main(Flux.java:212) Caused by: java.lang.RuntimeException: Problems reading chromosome ACCK01000003.1 from 14785 to 14780 I managed a work-around wherein I add 100 N nucleotides to the end of each of my 'chromosomes' as buffer and updated the ranges of the GTF file accordingly. Obviously this is not ideal (I would rather it not sequence past the boundary rather than having to add garbage to the simulator which I will have to remove later) Generated at Tue Feb 09 21:15:41 CET 2016 using JIRA 6.4#64014sha1:a67c3532e05d88bdc00cabc2cf5e0c6b82fa6023.