Lukket

Split File into Largest Even Multiple Given Number of Records

I need help with a task. Details below.

Given file A with x number of records, split the file into y number of smaller files containing equal number of records.

Notes:

Records are can only be identified by the line starting with "GROUP"

ALL text is surrounded by quotes ("")

All text is delimited by semicolon (;)

All text ends in a new line

All files end in a newline

All quotes and delimiters must remain intact.

Just splitting the file as is, no other changes. Keeping order.

File may contain any number of records - primarily used for files with >10k records

Records lengths vary by number of (new)lines

All files will not have the same order of body text between records and may end with different text; the only marker of a new record is a line beginning with "GROUP"

[login to view URL] holds the sample text. It holds 20050 'records' (Lines that start with group - all text after until the next 'Group' is part of the same 'record')

The primary issue I was having with this project was identifying blocks of records in order to manipulate them (see line 234-238 in [login to view URL] - tried to use a 'pointer', really want to use a map function?). My work so far is in [login to view URL] located at [login to view URL] This is just to show my thought process. Can discuss at end of project.

Input: path to file

Output: N number of files each with y number of records

Deliverables

Rnotebook that splits a file with contents following the structure outlined above

Preferably solved with a Tidy solution or Base R solution

Please include comments throughout code

Suggestions on next steps to make distributable

Next steps for me are to make into a shiny app and host on AWS or Azure for users to select their file and receive split files in return

Opportunity for ongoing codementor help

EXAMPLE:

[login to view URL] holds 20050 records. I add the path to the sample text in the R script or Notebook. I enter the number of resulting files I want. The script determines how many records should approximately be in each file, some left over in the last file is okay. If I enter 5 for the number of output files I want, the script should return 5 files, each with 4010 records. If I enter 6 for the number of output files, the script should return 5 files with 3340 records and the 6th would hold the remainder.

Summary and 'pseudocode'

Count number of records

Identify records

Find the number of records (y) that would split closest to evenly to result in user defined number of output files with y records in each

For line in notice_line:

For the number of lines in notice_line

If a line starts with "Group", Create an empty file

Name the empty file File_n , n for line number in notice_line

Put the line in the empty record

If the line is not group

Add the line to the existing file

Until the file has the number of records that would make all resulting files have approximately y number of records, where y is the number of resulting files the user would like to have outputted

Deliverables

Deliverables

Rnotebook that splits a file with contents following the structure outlined above

Preferably solved with a Tidy solution or Base R solution

Please include comments throughout code

Suggestions on next steps to make distributable

Evner: Excel, Dataindførsel, Databehandling, Datasøgning, Websøgning

Se mere: php script split file, lightbox multiple images number problem, count number records crystal reports, split text file into multiple files powershell, split text file into multiple files by number of lines in java, split text file into multiple files mac, split csv into multiple files linux, how to split large files into smaller chunks with java, split csv into multiple files java, split csv into multiple files windows cmd, split csv into multiple files python, script split file files, script split file, count number records group crystal reports, xls split file sheet macro, multiple file upload ajax multiple select, php multiple file upload select multiple, multiple file upload ajax multiple file selector, file input box multiple select php, crystal reports number records

Om arbejdsgiveren:
( 1 bedømmelse ) Băilești, Romania

Projekt ID: #27434677

11 freelancers are bidding on average $153 for this job

schoudhary1553

Hi, Greetings! ✅checked your project details: Split File into Largest Even Multiple Given Number of Records ✅Completed Time: In project deadline We have worked on 600 + Projects. I have 6 + years of the experien Flere

$220 USD in 4 dage
(404 bedømmelser)
8.2
writingapp

Hi. I have written many automation apps for formatting files. Can format to any templates. Ready to write your project. Will complete within 3 days

$90 USD in 3 dage
(64 bedømmelser)
5.4
Valuesolutions

Hello, please contact me for more details.... .....................................................

$140 USD in 7 dage
(10 bedømmelser)
4.6
Banze

I can I help you in Split File into Largest Even Multiple Given Number of Records. I have read and understood all your initial requirements, and I feel,I am producing quality data entry for my clients including; Web Re Flere

$200 USD in 7 dage
(30 bedømmelser)
4.4
azfatima512

Hi I am 100% sure this project . I am ready to start. I am expert in Microsoft Office specially Excel, Word and Access Database. Data Entry and Processing is my passion with years of experiences. I am talented and very Flere

$30 USD på 1 dag
(14 bedømmelser)
3.3
goodwork20k

Hello I am a powershell, perl and shell script expert and did similar split file scripting in my past and i can help you to split the file via any one of scripting Kindly confirm are you comfortable with Powershell o Flere

$200 USD in 7 dage
(3 bedømmelser)
3.0
hoorulain7586

__________________I am available right now______________________ Hi there, Quality and time is my commitment. I have done this many times, I want to say that, I will start right now and funds will be after your satisf Flere

$120 USD på 1 dag
(1 bedømmelse)
1.3
hontaroleksa

Hello. Oleksandr is here. I am interested in your project and I am sure that I can complete your project. I have +3 years of experience in web scraping, automation with python. I have done a lot's of scraping, web, des Flere

$140 USD in 7 dage
(0 bedømmelser)
0.0
sojibahammedwd

Hi, I can help you as your instruction. Please contact with me. I am interestedly to do the job. Thank you.

$278 USD in 6 dage
(0 bedømmelser)
0.0
Yco89

Dear client, my name is Yesi Cortes, I have read your project and I have a lot of experience in handling Excel, I can make your spreadsheet the way you request it and in the time required and in total I can carry out t Flere

$140 USD in 7 dage
(0 bedømmelser)
0.0
Aliistek

Hi, I have been working a global company as computer engineer. I had worked a lot of excel/data entry projects. I can work on your jobs and can finish as soon as possible.

$120 USD in 7 dage
(0 bedømmelser)
0.0