Delete multiple columns using awk or sedsplit string with awk and delimiterUsing Regex Breaking a text on the last digit using linux tools like sed, or awkcsv file adding and removing characters from rowsCount the number of unique values based on two columns in a spreadsheetProblem extracting data from file using awkReplacing a Substring with sedawk - compare 2 files and print columns from both filesBash help: awk columnsRemoving multiple space using sedDelete 'N' no lines only on the Nth occurrence of a pattern in a file using the sed/awk command
Does "he squandered his car on drink" sound natural?
C++ check if statement can be evaluated constexpr
What kind of floor tile is this?
What (the heck) is a Super Worm Equinox Moon?
"It doesn't matter" or "it won't matter"?
Circuit Analysis: Obtaining Close Loop OP - AMP Transfer function
Is this part of the description of the Archfey warlock's Misty Escape feature redundant?
How do you make your own symbol when Detexify fails?
What does "Scientists rise up against statistical significance" mean? (Comment in Nature)
Is it allowed to activate the ability of multiple planeswalkers in a single turn?
Shouldn’t conservatives embrace universal basic income?
Did the UK lift the requirement for registering SIM cards?
What fields between the rationals and the reals allow a good notion of 2D distance?
How do I tell my boss that I'm quitting soon, especially given that a colleague just left this week
Quoting Keynes in a lecture
What is the difference between lands and mana?
Make a Bowl of Alphabet Soup
What is the highest possible scrabble score for placing a single tile
I found an audio circuit and I built it just fine, but I find it a bit too quiet. How do I amplify the output so that it is a bit louder?
The IT department bottlenecks progress, how should I handle this?
When were female captains banned from Starfleet?
Why Shazam when there is already Superman?
Is there a RAID 0 Equivalent for RAM?
What is Cash Advance APR?
Delete multiple columns using awk or sed
split string with awk and delimiterUsing Regex Breaking a text on the last digit using linux tools like sed, or awkcsv file adding and removing characters from rowsCount the number of unique values based on two columns in a spreadsheetProblem extracting data from file using awkReplacing a Substring with sedawk - compare 2 files and print columns from both filesBash help: awk columnsRemoving multiple space using sedDelete 'N' no lines only on the Nth occurrence of a pattern in a file using the sed/awk command
I have a database with 6037 space-separated columns and 450 rows like the one below:
1807 1452 1598 1 6.655713 A B A B ... 0
1808 1452 1763 1 9.362033 0 0 A B ... A
1809 1452 1527 2 6.728534 A B A A ... B
1810 1452 1367 2 9.4055 A B A A B ... A
... ... ... ... ... ... ... ... ... ...
1812 1452 1258 1 6.363032 0 0 A B ... B
I want to get a new database with only the first 676 columns.
Preferably, some form that uses awk
or sed
command.
text-processing sed awk
New contributor
add a comment |
I have a database with 6037 space-separated columns and 450 rows like the one below:
1807 1452 1598 1 6.655713 A B A B ... 0
1808 1452 1763 1 9.362033 0 0 A B ... A
1809 1452 1527 2 6.728534 A B A A ... B
1810 1452 1367 2 9.4055 A B A A B ... A
... ... ... ... ... ... ... ... ... ...
1812 1452 1258 1 6.363032 0 0 A B ... B
I want to get a new database with only the first 676 columns.
Preferably, some form that uses awk
or sed
command.
text-processing sed awk
New contributor
The delimiter is the space.
– andrec
1 hour ago
add a comment |
I have a database with 6037 space-separated columns and 450 rows like the one below:
1807 1452 1598 1 6.655713 A B A B ... 0
1808 1452 1763 1 9.362033 0 0 A B ... A
1809 1452 1527 2 6.728534 A B A A ... B
1810 1452 1367 2 9.4055 A B A A B ... A
... ... ... ... ... ... ... ... ... ...
1812 1452 1258 1 6.363032 0 0 A B ... B
I want to get a new database with only the first 676 columns.
Preferably, some form that uses awk
or sed
command.
text-processing sed awk
New contributor
I have a database with 6037 space-separated columns and 450 rows like the one below:
1807 1452 1598 1 6.655713 A B A B ... 0
1808 1452 1763 1 9.362033 0 0 A B ... A
1809 1452 1527 2 6.728534 A B A A ... B
1810 1452 1367 2 9.4055 A B A A B ... A
... ... ... ... ... ... ... ... ... ...
1812 1452 1258 1 6.363032 0 0 A B ... B
I want to get a new database with only the first 676 columns.
Preferably, some form that uses awk
or sed
command.
text-processing sed awk
text-processing sed awk
New contributor
New contributor
edited 1 hour ago
dessert
24.7k672105
24.7k672105
New contributor
asked 2 hours ago
andrecandrec
61
61
New contributor
New contributor
The delimiter is the space.
– andrec
1 hour ago
add a comment |
The delimiter is the space.
– andrec
1 hour ago
The delimiter is the space.
– andrec
1 hour ago
The delimiter is the space.
– andrec
1 hour ago
add a comment |
2 Answers
2
active
oldest
votes
If the column delimiter in your file is a single character, e.g. a space, cut
can do that easily:
cut -d' ' -f-676 <in >out
This prints only the space-separated columns from the first to the 676th.
If you need e.g. every whitespace character to count as a delimiter, a sed
solution is:
sed -r 's/s+S+//677g' <in >out
This replaces every column (= at least one whitespace character followed by at least one non-whitespace character) beginning with the 677th with nothing. Using character groups you can specify any set of delimiters you need, e.g. for “4”, “#” and “K”:
sed -r 's/[4#K]+[^4#K]+//677g' <in >out
For a reasonable awk
approach kindly refer to steeldriver’s answer, but here is another one looping over the columns and only printing them (separated by FS
) if their number is <= 676:
awk 'for (i=1;i<=676;i++) printf (i==1?"":FS)$i; print ""' <in >out
For a character group you have to specify the output field separator for the output, e.g. for [4#K]
and "sep"
:
awk -F'[4#K]' 'for (i=1;i<=676;i++) printf (i==1?"":"sep")$i; print ""' <in >out
add a comment |
For a single-character delimiter (such as space or comma) I would recommend using the cut
command over either awk
or sed
.
However since you asked about awk
specifically, I think a reasonable way to do it would be to decrement the field count:
awk -v last=676 'while(NF>last) NF-- 1' datafile
Tested in GNU Awk (gawk
) and mawk
.
add a comment |
Your Answer
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "89"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
andrec is a new contributor. Be nice, and check out our Code of Conduct.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
var $window = $(window),
onScroll = function(e)
var $elem = $('.new-login-left'),
docViewTop = $window.scrollTop(),
docViewBottom = docViewTop + $window.height(),
elemTop = $elem.offset().top,
elemBottom = elemTop + $elem.height();
if ((docViewTop elemBottom))
StackExchange.using('gps', function() StackExchange.gps.track('embedded_signup_form.view', location: 'question_page' ); );
$window.unbind('scroll', onScroll);
;
$window.on('scroll', onScroll);
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2faskubuntu.com%2fquestions%2f1127670%2fdelete-multiple-columns-using-awk-or-sed%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
2 Answers
2
active
oldest
votes
2 Answers
2
active
oldest
votes
active
oldest
votes
active
oldest
votes
If the column delimiter in your file is a single character, e.g. a space, cut
can do that easily:
cut -d' ' -f-676 <in >out
This prints only the space-separated columns from the first to the 676th.
If you need e.g. every whitespace character to count as a delimiter, a sed
solution is:
sed -r 's/s+S+//677g' <in >out
This replaces every column (= at least one whitespace character followed by at least one non-whitespace character) beginning with the 677th with nothing. Using character groups you can specify any set of delimiters you need, e.g. for “4”, “#” and “K”:
sed -r 's/[4#K]+[^4#K]+//677g' <in >out
For a reasonable awk
approach kindly refer to steeldriver’s answer, but here is another one looping over the columns and only printing them (separated by FS
) if their number is <= 676:
awk 'for (i=1;i<=676;i++) printf (i==1?"":FS)$i; print ""' <in >out
For a character group you have to specify the output field separator for the output, e.g. for [4#K]
and "sep"
:
awk -F'[4#K]' 'for (i=1;i<=676;i++) printf (i==1?"":"sep")$i; print ""' <in >out
add a comment |
If the column delimiter in your file is a single character, e.g. a space, cut
can do that easily:
cut -d' ' -f-676 <in >out
This prints only the space-separated columns from the first to the 676th.
If you need e.g. every whitespace character to count as a delimiter, a sed
solution is:
sed -r 's/s+S+//677g' <in >out
This replaces every column (= at least one whitespace character followed by at least one non-whitespace character) beginning with the 677th with nothing. Using character groups you can specify any set of delimiters you need, e.g. for “4”, “#” and “K”:
sed -r 's/[4#K]+[^4#K]+//677g' <in >out
For a reasonable awk
approach kindly refer to steeldriver’s answer, but here is another one looping over the columns and only printing them (separated by FS
) if their number is <= 676:
awk 'for (i=1;i<=676;i++) printf (i==1?"":FS)$i; print ""' <in >out
For a character group you have to specify the output field separator for the output, e.g. for [4#K]
and "sep"
:
awk -F'[4#K]' 'for (i=1;i<=676;i++) printf (i==1?"":"sep")$i; print ""' <in >out
add a comment |
If the column delimiter in your file is a single character, e.g. a space, cut
can do that easily:
cut -d' ' -f-676 <in >out
This prints only the space-separated columns from the first to the 676th.
If you need e.g. every whitespace character to count as a delimiter, a sed
solution is:
sed -r 's/s+S+//677g' <in >out
This replaces every column (= at least one whitespace character followed by at least one non-whitespace character) beginning with the 677th with nothing. Using character groups you can specify any set of delimiters you need, e.g. for “4”, “#” and “K”:
sed -r 's/[4#K]+[^4#K]+//677g' <in >out
For a reasonable awk
approach kindly refer to steeldriver’s answer, but here is another one looping over the columns and only printing them (separated by FS
) if their number is <= 676:
awk 'for (i=1;i<=676;i++) printf (i==1?"":FS)$i; print ""' <in >out
For a character group you have to specify the output field separator for the output, e.g. for [4#K]
and "sep"
:
awk -F'[4#K]' 'for (i=1;i<=676;i++) printf (i==1?"":"sep")$i; print ""' <in >out
If the column delimiter in your file is a single character, e.g. a space, cut
can do that easily:
cut -d' ' -f-676 <in >out
This prints only the space-separated columns from the first to the 676th.
If you need e.g. every whitespace character to count as a delimiter, a sed
solution is:
sed -r 's/s+S+//677g' <in >out
This replaces every column (= at least one whitespace character followed by at least one non-whitespace character) beginning with the 677th with nothing. Using character groups you can specify any set of delimiters you need, e.g. for “4”, “#” and “K”:
sed -r 's/[4#K]+[^4#K]+//677g' <in >out
For a reasonable awk
approach kindly refer to steeldriver’s answer, but here is another one looping over the columns and only printing them (separated by FS
) if their number is <= 676:
awk 'for (i=1;i<=676;i++) printf (i==1?"":FS)$i; print ""' <in >out
For a character group you have to specify the output field separator for the output, e.g. for [4#K]
and "sep"
:
awk -F'[4#K]' 'for (i=1;i<=676;i++) printf (i==1?"":"sep")$i; print ""' <in >out
edited 1 hour ago
answered 2 hours ago
dessertdessert
24.7k672105
24.7k672105
add a comment |
add a comment |
For a single-character delimiter (such as space or comma) I would recommend using the cut
command over either awk
or sed
.
However since you asked about awk
specifically, I think a reasonable way to do it would be to decrement the field count:
awk -v last=676 'while(NF>last) NF-- 1' datafile
Tested in GNU Awk (gawk
) and mawk
.
add a comment |
For a single-character delimiter (such as space or comma) I would recommend using the cut
command over either awk
or sed
.
However since you asked about awk
specifically, I think a reasonable way to do it would be to decrement the field count:
awk -v last=676 'while(NF>last) NF-- 1' datafile
Tested in GNU Awk (gawk
) and mawk
.
add a comment |
For a single-character delimiter (such as space or comma) I would recommend using the cut
command over either awk
or sed
.
However since you asked about awk
specifically, I think a reasonable way to do it would be to decrement the field count:
awk -v last=676 'while(NF>last) NF-- 1' datafile
Tested in GNU Awk (gawk
) and mawk
.
For a single-character delimiter (such as space or comma) I would recommend using the cut
command over either awk
or sed
.
However since you asked about awk
specifically, I think a reasonable way to do it would be to decrement the field count:
awk -v last=676 'while(NF>last) NF-- 1' datafile
Tested in GNU Awk (gawk
) and mawk
.
edited 1 hour ago
answered 1 hour ago
steeldriversteeldriver
69.8k11114186
69.8k11114186
add a comment |
add a comment |
andrec is a new contributor. Be nice, and check out our Code of Conduct.
andrec is a new contributor. Be nice, and check out our Code of Conduct.
andrec is a new contributor. Be nice, and check out our Code of Conduct.
andrec is a new contributor. Be nice, and check out our Code of Conduct.
Thanks for contributing an answer to Ask Ubuntu!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
var $window = $(window),
onScroll = function(e)
var $elem = $('.new-login-left'),
docViewTop = $window.scrollTop(),
docViewBottom = docViewTop + $window.height(),
elemTop = $elem.offset().top,
elemBottom = elemTop + $elem.height();
if ((docViewTop elemBottom))
StackExchange.using('gps', function() StackExchange.gps.track('embedded_signup_form.view', location: 'question_page' ); );
$window.unbind('scroll', onScroll);
;
$window.on('scroll', onScroll);
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2faskubuntu.com%2fquestions%2f1127670%2fdelete-multiple-columns-using-awk-or-sed%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
var $window = $(window),
onScroll = function(e)
var $elem = $('.new-login-left'),
docViewTop = $window.scrollTop(),
docViewBottom = docViewTop + $window.height(),
elemTop = $elem.offset().top,
elemBottom = elemTop + $elem.height();
if ((docViewTop elemBottom))
StackExchange.using('gps', function() StackExchange.gps.track('embedded_signup_form.view', location: 'question_page' ); );
$window.unbind('scroll', onScroll);
;
$window.on('scroll', onScroll);
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
var $window = $(window),
onScroll = function(e)
var $elem = $('.new-login-left'),
docViewTop = $window.scrollTop(),
docViewBottom = docViewTop + $window.height(),
elemTop = $elem.offset().top,
elemBottom = elemTop + $elem.height();
if ((docViewTop elemBottom))
StackExchange.using('gps', function() StackExchange.gps.track('embedded_signup_form.view', location: 'question_page' ); );
$window.unbind('scroll', onScroll);
;
$window.on('scroll', onScroll);
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
var $window = $(window),
onScroll = function(e)
var $elem = $('.new-login-left'),
docViewTop = $window.scrollTop(),
docViewBottom = docViewTop + $window.height(),
elemTop = $elem.offset().top,
elemBottom = elemTop + $elem.height();
if ((docViewTop elemBottom))
StackExchange.using('gps', function() StackExchange.gps.track('embedded_signup_form.view', location: 'question_page' ); );
$window.unbind('scroll', onScroll);
;
$window.on('scroll', onScroll);
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
The delimiter is the space.
– andrec
1 hour ago